CUT THE 
KNOT 


CUT THE 


Alexander Bogomolny 


Cut the Knot: Probability Riddles 
Copyright © 2020 Svetlana Bogomolny 


Mathematics / Recreations & Games 
Mathematics / Probability & Statistics 


All rights reserved. No part of this publication may be 
reproduced, distributed, or transmitted in any form or 
by any means without the prior written permission of 
the copyright holder. 


Production, editing and typesetting done with the assistance of 
Wolfram Media. The compilation of riddles included in this book 


includes various user contributions to www.cuttheknot.org. 


Cover image design by Paige Bremner, knot image courtesy of Larvik 
Museum, Arts Council of Norway (CC BY-SA 3.0). 


Printed and distributed by Amazon.com Services. 


Contents 


Foreword by Nassim Nicholas Taleb vii 
Preface ix 
1 | Intuitive Probability 1 


Bridge Hands ¢ Secretary’s Problem ° Pencil Logo ° Balls of Two Colors I ° Division 
by 396 ° Fair Dice * Selecting a Medicine * Probability of Random Lines Crossing * Six 
Numbers, One Inequality e Metamorphosis of a Quadratic Function ° Playing with 
Balls of Two Colors ° Linus Pauling’s Argument ° Galton’s Paradox ° Solutions (p. 5) 


2 | What Is Probability? 13 


3 | Likely Surprises 15 


Balls of Two Colors II e Coin Tossing Contest ° Probability of Visiting 
Grandparents ° Probability of Increasing Sequence ° Probability of Two Integers 
Being Coprime ° Overlapping Random Intervals e Random Clock Hands ° Crossing 
River after Heavy Rain ° The Most Likely Position ° Coin Tossing Surprises I ° 
Solutions (p. 18) 


4 | Basic Probability 33 


Admittance to a Tennis Club ° Black Boxes in a Chain ° A Question of 

Checkmate ° Concerning Even Number of Heads ° Getting Ahead by Two 

Points ° Recollecting Forgotten Digit * Two Balls of the Same Color ° To Bet or 
Not to Bet ° Lights on a Christmas Tree ° Drawing Numbers and Summing Them 
Up ° Numbers in a Square ° Converting Temperature from °C to °F ¢ Probability of 
No Distinct Positive Roots ° Playing with Integers and Limits * Given Probability, 
Find the Sample Space ¢ Gladiator Games ¢ In Praise of Odds ° Probability in 
Scoring * Probability of 2” Beginning with Digit 1 * Probability of First Digits in 

a Sequence of Powers * Probability of Four Random Integers Having a Common 
Factor * Probability of a Cube Ending with 11 e Odds and Chances in Horse Race 
Betting ° Acting As a Team ° Sum of Two Outcomes of Tossing Three Dice ° Chess 
Players Truel ° Two Loaded Dice ° Crossing Bridge in Crowds ¢ Solutions (p. 39) 


5 | Geometric Probability 73 


Three Numbers ° Three Points on a Circle I e Three Points on a Circle II e Two Friends 
Meeting ° Hitting a Dart Board * Circle Coverage ° Points in a Semicircle ° Flat 
Probabilities on a Sphere ° Four Random Points on a Sphere * Random Numbers and 
Obtuse Triangle * Random Intervals with One Dominant ° Distributing Balls of Two 
Colors in Two Bags ° Hemisphere Coverage * Random Points on a Segment ° Probability 
of First Digit in Product * Birds on a Wire ° Lucky Times at a Moscow Math 

Olympiad ¢ Probability of a Random Inequality ° Points on a Square Grid ° A 

Triangle out of Three Broken Sticks ° Probability in Dart Throwing ° Probability in 
Triangle ° Solutions (p. 78) 


6 | Combinatorics 121 


Shuffling Probability * Probability with Factorials e Two Varsity Divisions ° Probability 
of Having 5 in the Numerator ° Random Arithmetic Progressions * Red Faces of a 

Cube ° Red and Green Balls in Red and Green Boxes ° Probability of Equal Areas 

on a Chessboard ° Random Sum ° Probability of Equilateral Triangle ° Shelving an 
Encyclopedia ° Loaded Dice I * Loaded Dice II * Dropping Numbers into a 3 x 3 
Square ° Probability of Matching Socks e Numbered Balls Out of a Box ¢ Planting Trees 
in a Row ° Six Numbers, Two Inequalities ° Six Numbers, Three Inequalities ° Tying 
Knots in Brazil e Tying Knots in Russia ° Guessing Hat Numbers ° Probability of an 
Odd Number of Sixes ° Probability of Average ° Marking and Breaking Sticks ° Bubbling 
of Sorts ° Probability of Successive Integers ° Solutions (p. 128) 


vi Cut the Knot: Probability Riddles 


7 | Conditional Probability 
Two Liars ° Three Liars ° Taking Turns to Toss a Die e Two Coins: One Fair, One 
Biased ° Probability of the Second Marble * Quotient Estimates I ° Quotient Estimates 
I ° The Lost Boarding Pass ¢ Lucky Contest Winners * Diminishing Hopes ¢ Incidence of 
Breast Cancer ° A Search for Heads and Its Consequences ° A Three Group Split ° Lewis 
Carroll’s Pillow Problem ¢ A Follow Up on Lewis Carroll’s Pillow Problem ° Sick Child 
and Doctor ° Right Strategy for a Weaker Player ° Chickens in Boxes * Two Chickens in 
Boxes * Two Chickens in Bigger Boxes ¢ Solutions (p. 162) 


8 | Expectation 
Randomly Placed Letters in Envelopes ° Expected Number of Fixed Points ° An 
Average Number of Runs in Coin Tosses * How Long Will It Last? ° Expectation 
of Interval Length on Circle ° Waiting for a Train ° Average Number of Runs in a 
Sequence of Random Numbers ° Training Bicyclists on a Mountain Road e Number 
of Trials to First Success ° Waiting for an Ace ° Two in a Row ° Waiting for Multiple 
Heads ° Probability of Consecutive Heads ° Two Dice Repetition ° Expected Number 
of Happy Passengers ° Expectation of the Largest Number ° Waiting for a Larger 
Number ° Waiting to Exceed 1 ° Waiting for All Six Outcomes ° Walking Randomly 
How Far? ° Expectation of Pairings ° Making Spaghetti Loops in the Kitchen ° Repeating 
Suit When Dealing Cards ° Family Size ° Averages of Terms in Increasing 
Sequence ¢ Solutions (p. 197) 


9 | Recurrences and Markov Chains 


A Fair Game of Chance ° Amoeba’s Survival ° Average Visibility of Moviegoers * Book 
Index Range ° Probability of No Two-Tail Runs ° Artificially Unintelligent ° Two Sixes 
in a Row ° Matching Socks in Dark Room ° Number of Wire Loops ¢ Determinants in 
Z, ° Gambling in a Company ° Probability in the World Series * Probability of a Meet in 
an Elimination Tournament ¢ Rolling a Die ° Fair Duel ° Solutions (p. 240) 


10 | Sampling of American Mathematics Competition Problems 
Five-Digit Numbers Divisible by Eleven * The Odds for Two Teams ° Cutting a String 
into Unequal Pieces ° Getting an Arithmetic Progression with Three Dice ° Probability 
of No Distinct Positive Roots ¢ Rolling a Defective Die * Counting Coins * The Roads 
We Take ° Marbles of Four Colors e Two Modified Dice ° Exchanging Balls of Random 
Colors ° Solutions (p. 264) 

11 | Elementary Statistics 


A Question about Median ° Family Statistics e How to Ask an Embarrassing 
Question * An Integer Sequence with Given Statistical Parameters ° Mean, Mode, 
Range and Median ° Solutions (p. 274) 


Appendix A | Dependent and Independent Events 
Appendix B | Principle of Symmetry 

Bibliography 

Index of Riddles 


157 


191 


237 


261 


273 


281 
285 
287 
293 


Foreword 
by Nassim Nicholas Taleb 


Maestro Bogomolny 


How do you learn a language? 

There are two routes; the first is to memorize imperfect verbs, grammatical rules, 
future vs. past tenses, recite boring context-free sentences, and pass an exam. The 
second approach is by going to a bar, struggling a little bit and, out of the need to 
blend-in and integrate with a fun group of people, then suddenly find yourself able to 
communicate. In other words, by playing, by being alive as a human being. I personally 
have never seen anyone learn to speak a language properly by the first route. Also, I 
have never seen anyone fail to do so by the second one. 

It is a not well-known fact that mathematics can also be learned by playing— 
just watch the private correspondence, discussions and pranks of the members of the 
august Bourbaki circle. Some of us (and it includes this author) do not perform well on 
tasks via “cold” approaches, unable to muster the motivation to do boring things. But, 
somehow we upregulate when stimulated or when there is play (or money) involved. 
This may disturb many people married to cookie-cutter pedagogical methods that 
require things to be drab, boring, and bureaucratic for them to be effective—but 
that’s reality. 

It is thanks to Maestro Alexander B. that numerous people have learned mathe- 
matics by the second route, by playing, just for the sake of entertainment. He helped 
many to make it their hobby. His mathematical website cut-the-knot has trained a 
generation—many seemingly approached the problem as hobbyists then got stuck with 
it. For, if you liked mathematics just a little bit, Maestro Bogomolny made it impossible 
for you to not love it. Mathematics was turned into a frolic. 

I discovered his riddles on social media. (Alexander B. does not like the word 
“problems”. I now understand why.) 

Social media brings out the best and the worst in people. He was rigorous yet 
open-minded, allowing people like me who did some mathematical economics to cheat 
with inequalities by using the various canned methods for finding minima and maxima. 
He even tolerated computerized mathematics, provided of course there was some rigor 
in the process. I initially knew nothing about him but could observe rare attributes: 
an extraordinary amount of patience and a remarkable sense of humor. One summer, 
as he was in Israel, I informed him that I was vacationing in Lebanon. His answer: 
“Walking distance”. He always had a short comment that makes you smile, not laugh, 
which is a social art. 

Alexander B. created a vibrant community around his Twitter account. He would 
pose a question, collect answers and patiently explain to people where they were wrong. 

I, for myself, started almost every day with a puzzle, with the excitement of unpre- 
dictability, as it took from 5 minutes to 4 hours to complete—and it was usually 
impossible to tell from the outset. For a couple of years, it was the first thing I looked 
at with the morning coffee. There was some mild competition, mild enough to be 
entertaining but not too intense to resemble an academic rat race. Once someone got a 
proof, we had to look for another approach so it paid to wake up early and beat those 
with a time zone advantage. 
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In the two years since he left us there has been no Saturday morning—104 of 
them—that I did not solve a riddle randomly selected on the web in his memory. But, 
without him, it is not the same. 


How did Alexander Bogomolny get there? 

I met him in an Italian restaurant in New Jersey. I was surprised to see a math- 
ematician who looked much more like a maturing actor than someone in a technical 
specialty: tall, athletic, jovial, and with a charismatic presence. But, as he had warned 
me, he had a severe hearing problem, the result of a medical treatment for the flu. 

This explained to me his veering away from an academic career to get involved in 
computer pedagogy. His hearing was worsening with time. It is hard to imagine being 
a professor with reduced auditory function in one ear (in spite of a hearing aid) and 
none in the other. 

There was something fresh and entertaining about him. He was happy. One could 
talk and laugh with him without much communication. 

He was neither interested in money nor rank—something refreshing as I was only 
exposed to academics who, whether they admit it or not, are obsessed with both. 
When I asked him about commercializing his website cut-the-knot his answer was “I 
have two pensions. Next year I turn seventy”. He wasn’t interested in poisoning his life 
for more money. 

Why did I start nicknaming him Maestro? Because it was pretty much literal: he 
played math like a master would with a musical instrument—and mostly to himself. 
He was physically bothered by a sloppy derivation or an error, as if he heard a jarring 
note in the middle of a sonata. It was a joy to see someone so much in sync with his 
subject matter—and totally uncorrupted by the academic system. 


Now, probability. In one conversation, I mentioned to him that probability riddles 
would be very useful for people who want to get into the most scientifically applicable 
scientific subject in the world (my very, very biased opinion). What I said earlier 
about play is even more applicable to probability, a field that really started with 
gamblers, used by traders and adventurers, and perfected by finance and insurance 
mathematicians. Probability applies to all empirical fields: gambling, finance, medicine, 
engineering, social science, risk, linguistics, genetics, car accidents. Let’s play with it 
by adding to his feed some probability riddles. 

His eyes lit up. Hence this book. 


I thank Marcos Carreira, Amit Itagi, Mike Lawler, Salil Mehta, and numerous 
others who supported us in this project. 

And a special gratitude to Stephen Wolfram and Jeremy Sykes for ensuring that 
Cut-the-Knot stays alive and that this book sees the day. Additional thanks to 
Paige Bremner, Glenn Scholebo, and other members of Wolfram Media for editing 
the manuscript. 


Preface 


On a superficial view, we may seem to differ very widely from each other 
in our reasoning, and no less in our pleasures: but notwithstanding this 
difference, which I think to be rather apparent than real, it is probable that 
the standard both of reason and Taste is the same in all human creatures. 


E. Burke, 1757, A Philosophical Inquiry into the Origin of Our Ideas of 
the Sublime and Beautiful 


The American Heritage Dictionary defines probability theory as the branch of math- 
ematics that studies the likelihood of occurrence of random events in order to predict 
the behavior of defined systems. (Of course “what is random?” is a question that is not 
all that simple to answer.) 

Starting with this definition, it would (probably) be right to conclude that proba- 
bility theory, being a branch of mathematics, is an exact, deductive science that studies 
uncertain quantities related to random events. This might seem to be a strange mar- 
riage of mathematical certainty and the uncertainty of randomness. 

This collection of probability problems is a collaborative effort born on the web 
and tested in social networks. Even though meeting over the wires is a far cry from 
a face-to-face encounter, it has a clear positive side. Web acquaintances are drawn 
to each other due to mutual interests, mostly without seeking to learn or attaching 
importance to unrevealed personal details. 

Some of the problems are original but most have been gathered from a variety of 
sources: books, magazine articles and online resources. The collection is reasonably 
comprehensive, though its value is mostly in that many if not most of the problems 
come with multiple solutions. So much so that quite often it was difficult to assign 
problems to specific chapters. 

Receiving solutions from people, many of whom would confess to not being math- 
ematicians or having forgotten all the math they studied in school or college, was an 
exhilarating experience. One of the ideas we pursued in writing this book was to share 
that experience with a broader audience. 

We prefer the term riddle to problem. 


Chapter 1 


Intuitive Probability 


One sees in this essay that the theory of probabilities is basically only 
common sense reduced to a calculus. It makes one estimate accurately 
what right-minded people feel by a sort of instinct, often without being 
able to give a reason for it. 


Laplace, 1814, Philosophical Essay on Probabilities 


The theory of probability supplies a good deal of counterintuitive results (see Chap- 
ter |3| page (15). However, the theory of probability arose from practical applications 
and is, in essence, a formal encapsulation of the intuitive view of chance. 

This chapter collects a few probability problems whose solutions are based on intu- 
ition and common sense and do not require any theoretical knowledge. 

Be warned, though, that even great mathematical minds sometimes get beguiled 
by intuition. For example, when asked for the probability that heads appears at least 


2 
once in two throws of a fair coin, Jean le Rond d’Alembert (1717-1783), replied 3 (the 


correct answer is D because in his opinion there was no reason to continue tossing the 


coin after heads showed up on the first toss. Thus, he judged events H, TH, TT as 
equiprobable. 


The bracketed items under the section and subsection heads are citations 
to items listed in the bibliography. The page numbers refer to the exact 
portion of the cited item. Some riddles and solutions were created for the 
Cut The Knot project and are not published elsewhere. 


Riddles 


1.1 Bridge Hands 

p. 39] 

A game of bridge is played by two pairs of players, with players on a team sitting 
opposite each other. Which situation is more likely after four bridge hands have been 
dealt: you and your partner hold all the clubs or you and your partner have no clubs? 


1.2 Secretary’s Problem 
p. 39] 
A secretary types four letters to four people and addresses the four envelopes. If the 


letters are inserted at random, each in a different envelope, what is the probability 
that exactly three letters will go into the right envelope? 
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1.3 Pencil Logo 
102] p. 1] 


A pencil with pentagonal cross-section has a maker’s logo imprinted on one of its five 
faces. If the pencil is rolled on the table, what is probability that it stops with the logo 
facing up? 


1.4 Balls of Two Colors | 
pp. 1-2], Problem 319] 


A box contains p white balls and q black balls, and beside the box lies a large pile of 
black balls. Two balls are drawn at random (with equal likelihood) out of the box. If 
they are of the same color, a black ball from the pile is put into the box; otherwise, 
the white ball is put back into the box. The procedure is repeated until the last two 
balls are removed from the box and one last ball is put in. 

What is the probability that this last ball is white? 


1.5 Division by 396 
Problem 56] 
Find the probability that if the digits 0, 1, 2, ..., 9 are placed in random order in 


the blank spaces of 5_383_8_ 2 936 5 8 203 9 3_76 the resulting number will 
be divisible by 396. 


1.6 Fair Dice 
Michel Paul 


Toss two dice. What is the probability of having 2 or 5 on at least one die? 


1.7 Selecting a Medicine 
pp. 97-98] 


Half the people who contracted a certain disease which is spreading across the country 
have died and half got better on their own. 

Two medicines, A and B, have been developed but not actually tested. A was 
administered to three patients and all survived. B was administered to eight patients 
and seven survived. 

In the unfortunate event of your contracting the disease, which medicine would you 
choose? 


1.8 Probability of Random Lines Crossing 
Problem 8] 


An urn contains the numbers 1, 2, 3,...,2018. We randomly draw, without replace- 
ment, four numbers in order from the urn which we will denote a, b, c, d. What is the 
probability that the following system will have a solution strictly inside (i.e., not on 
the axes) the first quadrant? 


ax+by = ab 
cz +dy = cd. 
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1.9 Six Numbers, One Inequality 
A. Bogomolny. Inspired by Donald Knuth’s puzzles in [z5]. 


Six distinct digits, 1, 2, 3, 4, 5, 6, are placed randomly into the boxes below. What is 
the probability that the indicated inequality holds between two digits adjacent to the 
inequality symbol? 


| 
E 
| 


Answer the same question concerning another placement of the inequality sign: 


EES 


1.10 Metamorphosis of a Quadratic Function 
A.A. Berzin’sh Problem 383]. 
The problem was offered at the XVII All-Union Mathematical Olympiad for grade 8. 


A math teacher in a special math school wrote the quadratic polynomial x? + 10x + 20 
on the blackboard and asked the students in a class of 24 to modify either the free 
coefficient or the one by x, every time by +1. The polynomial x? + 20x + 10 was the 
result of the experiment. 

What is the probability that, along the way, there was a quadratic polynomial with 
integer roots? 


1.11 Playing with Balls of Two Colors 
p. 2.16], Challenge Problem 10], pp. 79-80] 


An urn contains balls of two colors—black and white—at least one of each. You draw 
balls at random. In a round of ball drawings, you draw a ball, discharge it but note 
its color. You continue drawing and discharging the balls as long as they are of the 
same color. The first ball of a different color is placed back into the urn. This ends the 
round. The next and all successive rounds are played until the urn becomes empty. 
You win if the last ball to be drawn is black. 

How many balls of each color in the urn should you start with to maximize your 
probability of winning? 


1.12 Linus Pauling’s Argument 


On the occasion of his receiving a second Nobel prize, Dr. Linus Pauling, the chemist, 
remarked that, while the chances of any person in the world receiving his first Nobel 
prize were one in several billion (the population of the world), the chances of receiving 
the second Nobel prize were one in several hundred (the total number of living people 
who had received the prize in the past), and that it was therefore less remarkable to 
receive one’s second prize than one’s first. 

Is there a flaw in Pauling’s argument? 
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1.13 Galton’s Paradox 
p. 2.1.10 


This question from Francis Galton (1894) concerns the chance of three (fair) coins 
turning up alike (all heads or else all tails). 

At least two of the coins must turn up alike, and, as it is an even chance whether 
a third coin is heads or tails, the chance of (all three) being alike is 1 to 2. 

Where does the fallacy lie? 
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Solutions 


Bridge Hands 


If my partner and I have no clubs then all the clubs are in the hands of our competition. 
If we do have all the clubs, they have none of the clubs. Both pairs of players are equally 
likely to get all the clubs or to get none of them. 

It follows that the events of us getting all the clubs or getting none of them are 
equally likely. 


Secretary’s Problem 


Since there were just four letters and four envelopes, the fact that some three letters 
were placed into the right envelopes implies that the remaining letter has also been 
placed into the right envelope. Therefore, the probability is 0 that exactly three letters 
went into the right envelopes. 

A similar problem was published in under the caption The Careless Mail- 
ing Clerk: 

After a typist had written 10 letters and addressed the 10 corresponding envelopes, 
a careless mailing clerk inserted the letters in the envelopes at random, one letter per 
envelope. What is the probability that exactly nine letters were inserted in the proper 
envelopes? 

The solution is left as an exercise. 


Pencil Logo 


It is impossible for the pencil to stop on an edge as to make a face—with or without 
the maker’s logo—show on top. The probability of the event in question is 0. 


Balls of Two Colors | 


The key to solving the riddle is to notice that the parity of the number of white balls 
never changes. Thus, the last ball will be white if at the outset the number of white 
balls was odd; if it was even, the last ball will be black. 

Thus, the answer is 1 if the box originally contained an odd number of white balls 
and 0 otherwise. 


Notes 


Over time, the riddle has appeared in several disguises. For example, offers a game 
with circles and squares where, on each step, two different figures are replaced by a 
square and two equal ones, by a circle. 


Division by 396 


The riddle is obviously contrived. The last two digits of the longish number are 76, 
implying that the number is divisible by 4 and “ = 99 = 9 x 11. We need to check 
that, however the 10 digits are distributed over the empty spaces, the resulting number 
is divisible by 9 and also by 11. 
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A number is divisible by 9 if and only if the sum of its digits is divisible by 9. The 
sum of the missing digits is 


0+1+4+2+---4+9=45, 
which is divisible by 9. The sum of the present digits is 


5+3+8+3+8+2+9+3+6+5 
+8+2+0+3+9+3+7+6=90 


which is also divisible by 9. It follows that the resulting number is divisible by 9 
regardless of the placement of the missing digits. 

To check whether a number is divisible by 11 we compute two sums: that of the 
evenly placed digits and that of the oddly placed digits. All the missing digits come 
together along with 8, 3, 0 and 6, which add up to 45 + 17 = 62. The sum of the 
remaining digits is the 90 — 17 = 73. The difference of the two sums is 73 — 62 = 11 
which is divisible by 11, and we are finished. 


Fair Dice 


For a single die, all six outcomes are equally likely. So, on average, one gets 2 or 5 in 
2 1 
two cases out of six, making the probability P(2 or 5) = = = 3 This is less than the 


6 
fair half. 
For two dice, the situation is different and may be represented graphically: 


1 2 3 4 5 6 


There are 6 x 6 = 36 equiprobable squares/events. The event “2 or 5” occurs in 20 
20 


cases, making the probability P(2 or 5) = ag ° which is more than fair. 


Selecting a Medicine 


From the available data we may assume that, without any medicine, your chances of 
survival are 50:50. We shall never know whether either medicines does more good than 
harm. What we can estimate and compare are the chances of survival with and without 
taking any medicine. 

There were three patients treated with A. Were they not, the three of them would 


1 


1\3 
survive with the probability of (5) = 53° 


1 1 
The probability of having seven survivors out of eight is 8 - z which is ark 
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Thus, the probability of having seven survivors out of eight is four times smaller 
than that of having three survivors out of three. There is a good chance that medicine 
B proved to be more useful than medicine A. 


Probability of Random Lines Crossing 


The system is best rewritten as 


= J 


+ 


x 
b 
x 
d 


In this form it is obvious that we deal with two straight lines: the first intersecting 
x and y axes at (b,0) and (0,a), respectively, and the second intersecting the axes at 
points (d,0) and (0,c), respectively. From a geometric view, the system never has a 
solution on the axes because the four numbers are distinct, although random. Thus, 
the extra condition is a harmless red herring [19]. 

The two lines intersect within the first quadrant when (a — c)(b — d) < 0. This 
condition is as probable as its complement (a — c)(b — d) > 0. Thus, both have the 


1 
probability of 5° 


Six Numbers, One Inequality 


1 
The answer for both questions is 3 and this can be obtained without any calculations. 


Indeed, any placement of the digits that satisfies the inequality is paired with another 
one that does not. The two are obtained from each other by swapping the two digits 
adjacent to the inequality sign. Nicu Mihalache was the first to suggest that approach 
on Twitter. 

We can do the counting as well. The sample space consists of 6! elements. There 


are (5) ways to select a pair adjacent to the inequality symbol and only one way to 
place them so the inequality is satisfied. The remaining four numbers can be placed 
in 4! ways, giving the total number of “successful” combinations as (5) - 4! and the 
corresponding probability as 


6! 2 


Metamorphosis of a Quadratic Function 


Consider the polynomial fa (£) = x? +ar+ and its value at x = —1: fa»(—1) = b-a. 
Every time a or b changes by +1 so does the expression b — a. Now fio,20(—1) = 11 
whereas f20,10(—1) = —9. Thus, it follows that in the process of getting one from the 
other there ought to be a stage with fa»(—1) = 0, meaning that the integer —1 is 
the root of a provisional polynomial. The other root is naturally also an integer. The 
probability of this happening is 1. 
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Playing with Balls of Two Colors 


Solution 1 

Imagine that prior to every round, the remaining balls are lined up in a random order 
and you simply pick the balls from left to right. A round corresponds to a sequence 
of balls of the same color. The balls before the last round are all of one color; the 
balls before the penultimate round are in two groups of different colors, say black balls 
on the left, followed by white balls on the right. It could be the other way around. 
Regardless of the size of each group, these two orders are equally likely. But the order 
of the groups does impact the result: in one case, the last ball will be white, in the 


1 
other it will be black. Thus, the probability of either is rs; 


Solution 2 
An excerpt from “A Sampling Process,” B.E. Oakley and R.L. Perry, The Mathematical 
Gazette, February 1965, pp. 42—44. 


Let an urn contain m > 0 black balls and n > 0 white balls. Let umn(b) be the 
probability that the sequence of balls being discarded should begin and end with a 
black ball. Let umn(w) be the probability that it should begin with a white ball and 
end with a black ball. We require umn = Umn(b) + Umn(w). We have 


ins 


(unint J tm (1.1) 


m+n m+n-—1 


Equation [1.1] also holds if m = 1 and n > 1 since both sides are zero. 
Similarly, for m > 1, n > 2, 


m m 
mn = m,n— m,n— 1.2 
tmnt) = i (ummat) + ma) (1.2) 
a 1 . ; 
and again, since Umni(w) = mat Umo(w) = 0 and umo = 1, equation |1.2| holds if 
m 


m>1,n=1. 


Solution 3 
This is like Riddle[7.8] The Lost Boarding Pass on page 


Balls come in runs of the same color. The first passenger gets in somebody’s seat, some 
after him get in their seats until comes a fellow in whose seat sits the first passenger. 


1 
The round starts anew. The last fellow to enter the plane has the probability of 3 to 


get into his own seat. This probability is independent of how many people sit in their 
assigned seats and how many are in somebody else’s. 

The analogy is in the sequence of runs/rounds. The passenger runs can be looked 
at as the balls of the same color. 


Example 1 
Joshua B. Miller 


Consider the case of one white (W) ball and two black (B) balls. You have four possible 
sequences in which the balls are drawn from the urn (a dash designates an interruption 
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9 


due to a color change so that the ball before a dash is put back into the urn): 


Action Result | Probability 
WB — BB win ; 
BW-WB-B | win I 
BW — BW — W | loss : 
BBW -W loss 
Example 2 
Joshua B. Miller 
Two white balls and one black ball: 
Action Result | Probability 
WB- BW —W | loss ; 
WB-WB-B | win I 
BW -WW loss ; 
WWB-B win ; 
Example 3 
Joshua B. Miller 
Three white balls and one black ball: 
Action Result | Probability 
WWWB-B win : 
WWB-WB-B win ; 
WWB —- BW -W loss ; 
WB-WWB-B win 5 
WB-WB-WB-B| win ai 
WB-WB—- BW -W | loss = 
WB — BW - WW loss 5 
BW —- WWW loss ; 
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Solution 4 
Joshua B. Miller, Thamizh Kudimagan 


The penultimate round will begin with an urn with composition of black and white 
balls. For each of these urn compositions (not equally likely as penultimate rounds), 
there are just two ways to have a penultimate round draw, and they are equally likely 
from that urn. The two ways will have different pay offs. 


Solution 5 
Alejandro Rodriguez 


Let P(B,W) be the probability of winning given that the urn contains B black balls 
and W white balls. We can compute P(B, W) as follows: 


B 
P(B,W) = V(B — 1, W, black) + 


=z v V(B,W — 1, white) 


B+W 
where V(B,W, x) is the probability of winning given that there are B black balls and 
W white balls remaining and the last ball drawn is of color x. V(B,W,x) can be 
calculated as: 


B 
V(B,W, black) = p7 yy” (B — 1, W; black) + 5 W PB, w) 


+W 
W 


B 


=z W V(B,W — 1, white) 


Boundary conditions are given by: 
V(B,0,x)= P(B,0)=1 
V(0,W, x)= P(0,W) =0. 


Plug the formula into a spreadsheet and get P(B, W) = 0.5 for B and W greater 
than zero and less than 30 (my guess is that it is always the case). 


Notes 


This is a problem from P. Winkler and also from M. Gardner [45]. 

Gardner announces the result and refers to the paper “A Sampling Process” by 
B.E. Oakley and R.L. Perry in The Mathematical Gazette (February 1965, pp. 42-44), 
see Solution 2 on page |8| Winkler refers to Gardner and gives a solution by Sergiu 
Hart from the Hebrew University of Jerusalem. Solution 1 is a paraphrase of that in 
101). 

Winkler mentions Hart’s observation that the game is isomorphic, in a sense, to 
Riddle [7.8] The Lost Boarding Pass on page[158] This was discussed in the Analogous 
to Other Puzzles section. 

P. Nahin offers that as Challenge Problem 10 in where he presents computer 


1 
simulation that confirms the probability of z Like Gardner and Winkler, he refers to 


the long proof by Oakley and Perry. 
Joshua B. Miller was instrumental in shaping the present section. 
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Linus Pauling’s Argument 


The chance of receiving a second Nobel prize only depends on the total number of living 
Nobel prize winners if you know for a fact that the Nobel prize committee have decided 
to honor, again, one of the selected group. However, Nobel prizes are awarded quite 
independently of past awards, and so the actual chance of being awarded two Nobel 
prizes is one chance in a billion times a billion (if you suppose, which is implausible, 
that every human being on earth has an equal chance of winning the award). 

Linus Pauling’s Nobel prizes, incidentally, were the prize for Chemistry in 1954 and 
the Peace Prize in 1962. 


Galton’s Paradox 


First, this is a fallacy. An experiment with three fair coins has eight equiprobable 


1 
outcomes, so each comes up with the probability of z Two of them—H HH or TTT— 


1 1 1 1 
combine into an event of the probability 3 + g = 2 not 5° Thus the question: Where 


does the fallacy lie? 
The argument is valid to a point. There are indeed always two coins that come up 
the same way: both heads up or both tails up. (Pigeonhole Principle, .) It is also true 


1 
that the remaining coin comes up either way with the probability of z The conclusion 


is, however, false because there is no certainty of which two coins get involved in the 
first part of the argument. If the two coins were fixed, say, #1 and #2, then certainly 


1 
the third one—73—could come up either way with the probability of 7 But, in that 


1 1 1 
case, #1 and #2 may come up alike with the probability of 2 - 3373 All four 
possible outcomes are equiprobable: HH, HT, TH, TT. Thus, there is no certainty 
that #1 and #2 will come up alike. 


Among eight equiprobable outcomes of three tosses of a coin, exactly two come up 


3 
alike with the probability of T but there is no knowing which pair it is: 


HHH, HHT, HTH, HTT, THH, THT, TTH,TTT. 


Chapter 2 


What Is Probability? 


“Probability is the bane of the age,” said Moreland, now warming up. 
“Every Tom, Dick, and Harry thinks he knows what is probable. The fact 
is most people have not the smallest idea what is going on round them. 
Their conclusions about life are based on utterly irrelevant—and usually 
inaccurate—premises.” 


Anthony Powell, 1960, Casanova’s Chinese Restaurant 


Intuitively, the mathematical theory of probability deals with patterns that occur in 
random events. For the theory of probability, the nature of randomness is inessential. 
(Note for the record, that according to the eighteenth century French mathematician 
Marquis de Laplace, randomness is a perceived phenomenon explained by human igno- 
rance, while late twentieth century mathematics came with a realization that chaos may 
emerge as the result of deterministic processes.) An experiment is a process—natural 
or set up deliberately—that has an observable outcome. In the deliberate setting, the 
words experiment and trial are synonymous. An experiment has a random outcome if 
the result of the experiment cannot be predicted with absolute certainty. An event is a 
collection of possible outcomes of an experiment. An event is said to occur as a result 
of an experiment if it contains the actual outcome of that experiment. Individual out- 
comes comprising an event are said to be favorable to that event. Events are assigned 
a measure of certainty which is called probability (of an event). 

Quite often the word experiment describes an experimental setup, while the word 
trial applies to actually executing the experiment and obtaining an outcome. 

A formal theory of probability was developed in the 1930s by the Russian mathe- 
matician A.N. Kolmogorov. 

The two fundamental notions of probability theory are sample space and equiprob- 
able events. 

A sample space is where the events take place. The nature of a sample space depends 
on the kind of experiment under consideration. 

When tossing a coin, two outcomes are possible: heads and tails. The sample space 
for a single toss of a coin is {H,T}. Two events are possible and, provided the coin is 
fair, these two events are equiprobable. If P(X) denotes the probability of event X, 
then, for an experiment with a fair coin, P(H) = P(T). On the other hand, a coin 
always comes to rest with one of the sides up. We interpret that with the formula 
P(H or T) = 1. 


1 is the probability of certain truth. 


0 is the probability of certain falsity. 


It follows that, since events H and T are equiprobable and cannot happen simul- 


taneously then P(H) = P(T) = =. 
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Similarly, when tossing a fair die, the sample space consists of six equiprobable 
1 
events {1,2,3,4,5,6} such that each of them comes with the probability of z 


In addition to sample space and equiprobability, elementary probability theory 
employs a few basic notions—definitions and theorems. These are discussed at greater 
length in the appendices. 


Chapter 3 


Likely Surprises 


It is remarkable that a science which began with the consideration of 
games of chance should have become the most important object of 
human knowledge. 


Pierre Simon Laplace, 1812, Théorie Analytique des Probabilités 


Riddles 


3.1 Balls of Two Colors II 
[36], 


An urn contains w white balls and b black balls, w > 0 and b > 0. The balls are 
thoroughly mixed and two are drawn, one after the other, without replacement. Let 
W; and B; denote the respective outcomes “white on the it draw” and “black on the 
it draw,” for i = 1,2. 


Prove that P(W2) = P(W1) = a (Which clearly implies a similar identity for 


wtb 
Bı and Bə). 
Furthermore, P(W;) = — for any i not exceeding the total number of white 
w 
balls w. 


3.2 Coin Tossing Contest 
Problem 99], 


A tosses a fair coin 20 times; B tosses the coin 21 times. 
What is the probability that 6 will have more heads than A? 


3.3 Probability of Visiting Grandparents 
Mathematical folklore 


A boy, who lives in Moscow, visits his grandparents every Thursday. Both his maternal 
and paternal grandparents are alive and live at the two ends of the same metro line. 
The boy lives close to a metro station in between on the same line. When he gets down 
to the platform, he takes the first train that arrives, regardless of the direction. 

Short of “things happen,” what could be a rational explanation of the fact that 
after a (long) period of time he observes that he has made twice as many visits to one 
pair of grandparents than to the other? 

The trains go in both directions at the same fixed interval, and the boy gets down 
to the station at about the same time every Thursday. The trains are so scheduled 
that they never arrive at the same time from both directions. 
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3.4 Probability of Increasing Sequence 


Problem 1] 


Slips of paper with the numbers from 1 to 99 are placed in a hat. Five numbers are 
randomly drawn out of the hat one at a time (without replacement). What is the 
probability that the numbers are chosen in increasing order? 

What is the probability that the first number drawn is a 1? 


3.5 Probability of Two Integers Being Coprime 


What is the probability that two randomly selected integers are coprime, i.e., have no 
common prime factors? 


3.6 Overlapping Random Intervals 


Problem 102], 


2n > 4 points are chosen uniformly randomly on the interval [0, 1], say, X1, X9,..., Xan 
for 1 < j < n, where J; is the closed interval with endpoints X2;_; and X2;. Find the 
probability that one of the intervals J; has overlaps with all other intervals. 


3.7 Random Clock Hands 
Mathematical folklore 


A clock face is split randomly into alternating red and pink sectors. This question is 
about the relative probabilities of each of the clock hands being in the red 


e when the watch is functioning properly. 


e when the watch is broken and the hands rotate at a fixed angle (and, of course, 
at a fixed rate). 


e when the hands jump uniformly randomly all over the dial. 


3.8 Crossing River after Heavy Rain 
Marcus Moore] 


A traveler approaches a river spanned by bridges that connect its shores and islands. 
There has been a great storm the night before, and each of the bridges was as likely 
as not to be washed away. How probable is it that the traveler can still cross? 
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River 
— 


3.9 The Most Likely Position 
Problem 6] 


A standard deck has 52 cards, of which four are aces. When the deck is shuffled, what 
is the most likely position of the first ace? 


3.10 Coin Tossing Surprises | 
p. 303] 


A fair coin is tossed repeatedly. 


1. What is the expected number of tosses before HT shows up for the first time? 
2. What is the expected number of tosses before TT shows up for the first time? 
3. What is the probability that HT shows up before TT? 


4. What is the probability that HT shows up before H H? 
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Solutions 


Balls of Two Colors II 


Solution 1 


B3, (7 


We must remember the formula for the total probability: 
P(W2) = P(W2|W1) - P(W1) + P(W2|B1)- P(B1), 


from which 


w—1 w w b 
P > . 
(Wa) b+w-1 b+w b+w-1 b+w 
w:-(w—1+b) 


(b+w—1)-(b+w)’ 


which is simplified to 2; 
w+b 


The wonderful thing here is that the initial probability ia 5 is being maintained 
w 
even though the balls are not returned to the urn. 
Solution 2 
N.N. Taleb 
e Ea EE T 
ie ETT aa 
(w= 1) 
P(W2) = P(B P(W; 
(W2) Fie 1 ew Zi 
2 
w w 
-2 ae) 1- — 
(=a (F) p u Ww ( Ez) 
7 b+w-—2 (b+ w—2)(b+w) 


oiler a 


(b+ w- 2)(b + w) j b+w-2 


— W 
~ b+w 


More generally, 


Solution 3 
Joshua B. Miller 


If you empty the urn one ball at a time you will generate a sequence of length n = w+b. 
By exchangeability, each sequence is equally likely. What is the probability that, on 
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the ¿tł draw, you draw w? It is the number of unique sequences with w in the i*? 


(2 Beare 


a n! 


Sew 


n—1 n 
position, ( Ae divided by the number of unique sequences ( ), i.e., 
w 


wtb 


Coin Tossing Contest 


Solution 1 
Amit Itagi 


We shall consider several events concerning the possible total outcomes: 
e Hgs,~—the event of B having more heads than A, 
e Hp<a—the event of B having at most as few heads as A, 
e Tgs4—the event of B having more tails than A. 


We are interested in the probability of the first of these events. Observe that 
Hg<a = Tg>a. On the other hand, when it comes to probabilities, by symmetry, 


P(Hp>a) a P(Tp>a). 
If X is the total number of outcomes, then 


X = Hg>4 U Hg<4 and Hg>4 N Hg<a = 9. 
Finally then, 
1 = P(Hg>4)+ P(Hg<4) 


= P(Hpsa) T P(Tps a) 


i ; 1 
implying P(Hp>a) = 5- 


Solution 2 
Long Huynh Huu 


Choose A from 1,...,20, following a symmetric distribution. Choose B from 0.5, 
1.5,...,20.5, similarly. By symmetry, P(A < B) = P(A > B) = 0.5. 

(By “symmetric” I mean A— 10.5 and B — 10.5 have same distribution as 10.5 — A 
and 10.5 — B.) 
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Solution 3 
Faryad D. Sahneh 


There are two possibilities for getting more heads in experiment B with 21 toss trials 
than experiment A with 20 toss trials. 


e Event 1: The number of heads in the first 20 trials of B is already more than 
that of A. 


e Event 2: The number of heads in the first 20 trials of B is exactly equal to that 
of A, and trial 21 is a head. 


Let p = P(number of heads in the first 20 trials of B is equal to that of A). Then, 


1 1 
P(Event 1) = -(1 — p) and P(Event 2) = p x —. 
2 2 


Therefore, 


P(number heads in experiment B > number of heads in experiment A) 


1 
= P(Event 1) + P(Event 2) = 5° 


Solution 4 


If, in general A tosses a fair coin n times and B tosses the coin n + 1 times, then, due 
to symmetry, the probability of B having more heads than A is the same as him/her 
having more tails and this with equal probabilities. B must either have more heads or 


1 
more tails than A. Hence, both probabilities are 5° 


Notes 


Interestingly, even when both players toss a fair coin an equal number of times, say 
(m = n = 20), there’s a 40% chance that one of them ends up with more heads than 
the other. Equal distributions do not collapse to zero. 

Following is the result of a simulation where both players have the same number 
(m = n = 20) of tosses, but we count the difference of heads between them. We plot 
the probability as a function of the difference of heads: 


06 
054 
044 
034 
024 e 


0.1 4 


0o} s. a’ 


We note that, given two players toss a fair coin the same number of times, there’s 
a 44% chance one of them has at least one more heads than the other, so a fairly high 
chance of a 1 heads lead. 
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As we insist on a lead of more than 1 heads, the probability drops off almost linearly. 
Chances of a 2 heads lead are about 30%, 4 heads lead about 15%, 6 heads about 5% 
and practically zero chance of an 8 heads lead. 


Probability of Visiting Grandparents 


Solution 1 


The schedules of the trains in the two directions are shifted relative to each other: 
Le ene See eee eee see eee eee pees ee ee ee ce ee eee eel 
Aa 


If, e.g., on the red line, the trains arrive at 3, 6, 9, etc. minutes to an hour, while 
on the blue line that’s 1, 4, 7, 10 minutes, etc., then the chances of the boy catching 
the red direction are 2 to 1. 


Solution 2 
Amit Itagi 


The probability distribution function of his arrival is such that the ratio of the areas 
under the probability distribution function for time intervals corresponding to the two 
directions is ~2. 


Solution 3 
Elia Noris 


im. 


Probability of Increasing Sequence 


Solution to Question 1 

It does not at all matter whether there are 99 slips of paper or just five. The best way 
to approach the problem is to consider drawing just two numbers. So you’ve drawn two 
distinct numbers: what is the probability that the first one will be less than the second? 
Well, there are just two ways to draw two chosen numbers: the smallest either comes 
first or last, meaning that the probability of drawing two numbers in their natural 


1 
der is =. 
order is 5 


When it comes to five numbers. There are 120 permutations of five objects and 
thus 120 ways to draw a given set of five numbers. Only in one permutation out of 120 
all five numbers follow in their natural order of magnitude. 


Solution 1 to Question 2 
Josh Jordan 


# of ways to choose four other numbers when one has already been chosen 


G) s 


E ~ 99° 


# of ways to choose five numbers 
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Solution 2 to Question 2 
Michael Weiner 


Let A be the event that a 1 is drawn. Let B be the event that the numbers come out in 
increasing order. I asked for P(A|B) but A and B are independent, so P(A|B) = P(A). 


5 
Five numbers are drawn; the probability that one of them is 1 is 99° 


Probability of Two Integers Being Coprime 


Notes 


A short solution of this statement by Aaron Abrams and Matteo Paris |1| in the College 
Mathematics Journal was introduced by a letter from Henry L. Adler where he tells 
the story of the proof. He found the question “natural to ask” so it was no surprise 
when students in his introductory number theory class indeed raised the question. He 
answered it with a “traditional proof” that required “considerable prior knowledge of 
number theory,” but then the two authors (16- and 17-year-olds at the time) came up 
with a short and intuitively clear proof. He then encouraged the boys to share their 
proof with the readers of the College Mathematics Journal “who might be asked the 
same question in their classes.” 

A few lines of the proof made a reference to pp. 202-204] for the justification 
of the concepts involved (the formalism of selecting a random integer). Yagloms’s 
book also poses the question at hand and provides a more or less elementary but 
very detailed solution. Yagloms refers to the question as “Chebyshev’s problem” and 
outlines the contribution of Pafnuty L. Chebyshev in a short footnote, although there 
is no indication that the solution in the book originated with the famous Russian 
mathematician. It is rather straightforward and does not require “considerable prior 
knowledge of number theory.” 

One that does can be found in the classic, An Introduction to the Theory of Numbers 
by Hardy and Wright [49]. Without doubt, Chebyshev could have authored either of 
the two proofs, but Hardy and Wright do not mention his name either as the question 
poser or a solver. 

In any event, I find it amusing that this “natural to ask question” might have been 
first asked by a mathematician of Chebyshev’s stature. 

Following are all three proofs in this order: Yagloms’s, Abrams and Paris’s and 
then Hardy and Wright’s; the latter will be extended shortly with relevant information 
from number theory. A simple intuitive proof precedes the three. 

Aaron Abrams and Matteo Paris’s proof can be found in pp. 230-233], where 
the authors also refer to Yagloms’s book but do not mention Chebyshev’s name, the 
boys’ ages, or Adler’s story. 


Solution 1, a la Euler 


The common convention for selecting a random integer is selecting one uniformly 
randomly from the set {1,2,...,n} and then letting n > oo. 

For a prime p, every p“ integer is divisible by p. Assuming that divisiblity by p for 
two random numbers constitute independent events, the probability that two numbers 


1 
are not divisible by p is — so that the probability that at least one of them is divisible 
p 


1 
by pis L——. 
P 
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Further, divisibility of a number by one prime is independent of its divisibility by 
another. Thus, rather informally, the sought probability is 


n (-#)-4 


p is prime 


as proved by L. Euler in 1735. For a more rigorous proof, see Theorem 332]. 


Solution 2, Straightforward 


If two numbers are coprime, then, in particular, no prime is their common factor. So 
we start with a simpler problem: 


What is the probability that 2 is not a common factor of two random 
integers? 


Given two random numbers a and b, for each, the probability of being divisible 

1 1 1 1 

by 2 is 3 such that the probability of them both being divisible by 2 is z2 T 
1 
Therefore, the probability that 2 is not their common factor is 1 — 7 

Similarly, for any prime p, the probability that p does not divide both selected 
numbers is 1 — —. 

p? 

There is no basis to assume that divisibility by one prime number is in anyway 
related to divisibility by any other prime number. We therefore assume that events of 
divisibility by prime numbers (or their complements) are independent such that the 
probability that two numbers are not simultaneously divisible by any prime should be 
the product 


(-2)(-8)(-8)-- 16-9). 


where the product is taken over the whole sequence of prime numbers. This needs to 
be understood as the limit of finite products: 


1 ; 2 1 
I-s IL(1- 2). 


where {px} is the sequence of prime numbers pı = 2, p2 = 3, pa = 5 and so on. 
We have to check that the limit exists: 


k=1 Pk Pk Pk 
P ? : : 1 1 
because each fraction is the sum of the geometric series 1+ 57+ 4+ Gt... 
i Pk Pk Pk 
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But, if we multiply out, we get a surprising result: 


ice eee a ee eee eel 
pa. Pk Ph PR ae ee ae 


where the sum is over all integers whose prime factors do not exceed pp. As k grows, 
the gaps in the series are filled in so that at the limit 


ice ee ee ae ee oer 
' pz | pt © pb ° ee 92 32 42 5270 


where on the right, each integer appears exactly once due to the Fundamental Theorem 
of Arithmetic, i.e., the fact that every integer has a unique (up to the order of the 
factors) representation as the product of powers of distinct prime numbers. The sum 
on the right is known to converge and its sum, as was first found by L. Euler, equals 


from which it follows that the probability of two random integers being coprime is —. 
T 


Solution 3, Shortest Option 

Let q denote the sought probability of two random integers being coprime. Pick an 
1 

integer k and two random numbers a and b. The probability that k divides a is p and 

the same holds for b. Therefore, the probability that both a and b are divisible by k 


1 b 
equals gz The probability that a and b have no other factors, i.e., that gcd @ =) = 


b 
1, equals q by our initial assumption. But gcd € ) = 1 is equivalent to gcd(a, b) = k. 


k’k 
Assuming independence of the events, it follows that the probability that gcd(a, b) = k 
equals — -q = L 
qu k2 q a k2 A 


Now, k was just one possibility for the greatest common divisor of two random 
numbers. Any number could be the gcd(a, b). Furthermore, since the events gcd(a, b) 
are mutually exclusive (the gcd of two numbers is unique) and the total probability of 
having a gcd at all is 1, we are led to 


implying that 


—1 

“1 6 

q= bs a =<. 
k=1 
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Solution 4, Number-Theoretic 


Notations first: 
¢(n)—Euler’s totient function, the number of integers not greater than n and 
coprime to n: 


&(n) =} ln). 
k=1 


p(n)—the Mobius function: 


1 
p(n) = 0, if n has a squared factor; 
U(pip2...pr) =(—1)*, if all primes pp are different. 


Both functions ¢ and u are multiplicative, meaning that for coprime a and b, 


(ab) = $(a)9(b) and (ab) = u(a)u(b). 


The two functions are related in the following manner: 


o(n) =n wld) => ald) = oan (4) = YO aaa. 


d\n d|n d|n dd’=n 


u has an important property: 


gasfe tazi 


otherwise. 
d|n 


Finally, remember the “Big O” notation: to say that f = O(g) is equivalent to 
saying that |f| < Clg|, for C not dependent on g. 


mE dm dd'<n 
s [n/a] n Z 
=a ¢ =} a (E-B) 
d=1 d’=1 a= 


ll ll 
NI = NI = 
SEM 
Me or 
E S 
+ SIA 
O + 
2A TT: QO 
3 o—~ 
Me £ 
Mite 
al= 
Qe 


~ 2¢(2) 
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where ¢(s) is the famous ¢ function: ¢(s) = you. As mentioned above, it was 
n= 


2 


established by L. Euler that ¢(2) = ee 


Noteworthy is the fact that ®(n)+1 is the number of terms in the n'è Farey series 
which consists of an ordered list of irreducible fractions from the interval [0,1] 
whose denominators do not exceed n. 

Now, assume n is given and consider fractions ; that satisfy 


q>Oandl<q<p<l. 
Say &€, is the number of those fractions in lowest terms out of the total Yp = 


1 2 
mast) ~ F fractions. Thus, the probability that q and p are coprime is given by 


Overlapping Random Intervals 
Graham Brightwell] 


Instead of the unit interval, we drop 2n + 1 points on a circle. One, say X, is the 
starting point for traversing the circle. All other points are labeled in successive pairs 
a,b,.... By assumption, all cyclic orderings of 2n points are equally likely. Given such 
point placement, start with X and move clockwise until you meet one symbol, say a, 
for the second time. Turn around, and move counterclockwise until you meet another 
symbol, say b, twice. Necessarily, you have gone beyond X. Now look at the string of 
symbols between b and a at the extremes of this trajectory but running clockwise from 
the (second) b to the (second) a. This string includes X, the other a and the other b. 
Look at the order in which these three occur. There are just three possibilities: 


1. bXa, 
2. Xba, 
3. Xab. 


Claim 


In cases 2 and 3, the interval b intersects all others, whereas in case 1 there is no 
interval that intersects all others. 


Proof of the Claim 
In cases 2 and 3, if the b-interval does not intersect some c-interval, then the relevant 
symbols occur either as bccXb or bXccb, contradicting the choice of b in the first case 
and the choice of a in the second. 

On the other hand, in case 1, if a cinterval intersects both the b-interval and the 
a-interval, then both c symbols must lie in the string, contradicting the choice of b. 
This completes the proof. 
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Since all distributions are equally likely, each of the three cases has probability 


1 2 
of 7 meaning that the existence of an interval that meets all other intervals is 3? 


independent of n. 


Notes 


Justicz, Scheinerman and Winkler proved a more general result in [59]. 
For any k < n, the probability that in a family of n random intervals there are at 
least k which intersect all others is 


9k+1 


Ge 


which is independent of n. 


Random Clock Hands 


As hard as it may be to believe, the events of the two hands hitting red are independent 
even when they are rigidly attached at a certain angle. Imagine throwing the first hand 
randomly, attaching the second hand at a certain angle but only drawing the latter. 
Would you think that the second hand (which actually appears alone) is less random 
or has a different probability of hitting red than the lone first hand? 

To add a little more rigor to the above argument, let R be the set of red points on 
the unit circle. (Selecting a hand is equivalent to picking a point on the unit circle.) 
We may also think of it as a set R, in the interval [0, 27). Let a be the angle between 
the two hands. Selecting the second hand is equivalent to selecting a point in [0, 27) 
and then shifting it by a, i.e., selecting a random point in [0, 277) +a, which is the same 
[a, 27+a). The set R, undergoes a shift into Ro +a, which has the same relative size as 
the set Ro. Furthermore, the original set R is clearly left unchanged by the shift. What 
does change is the point at which the circle is cut before being unfolded into a segment. 


Crossing River after Heavy Rain 


The configuration of bridges has a remarkable symmetric structure, Figure 
Imagine that there is a boat floating down the stream. Any bridge that withstood 
the storm will block the passage of the boat. The boat will pass through if a sufficient 
number of bridges have been washed away. What is the probability that the boat will 
get through? 
The key observation that brings a solution to the problem is that it has the same 
abstract structure for the traveler and the boat, Figure 


River 
-i 


Figure 3.1: Crossing river after heavy rain. 
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Entry Exit 
Connector 
Entry Exit 


Figure 3.2: Crossing river after heavy rain: abstract structure. 


For the traveler, bridges 1 and 2 are the entries, whereas bridges 4 and 5 are exits. 
For the boat, the entries are at bridges 1 and 4 and the exits are at bridges 2 and 5. 
A bridge that has withstood the storm is good for the traveler but bad for the 
boat and vice versa. But each individual event has the probability of 50%. Having an 
identical abstract structure suggests (by the Principle of Symmetry, Appendix [B] on 
page |285) the following observation (writing P(X) for the probability of event X): 
1. P(traveler crosses) = P(boat gets through). 
But, beyond that, as we just observed, what is good for one is bad for the other 
such that 
2. P(boat gets through) = P(traveler does not cross). 


As a consequence of 1 and 2, we get 
3. P(traveler crosses) = P(traveler does not cross), 


with the inevitable conclusion that both probabilities evaluate to 50%. 


The Most Likely Position 


Notes 


The formulation of the problem seems to be open to interpretations. There are at 
least two: 


1. On many, many trials, what is the expected position of the first ace? 
2. For what position of the first ace is the probability the highest? 


The first of these is discussed elsewhere (Problem on page |194). The second 
question may appear obvious to experts, with an immediate answer. The answer is still 
surprising to nonexperts, so this interpretation will be solved: 


A standard deck has 52 cards, of which four are aces. When the deck is 
shuffled, for what position of the first ace is the probability the highest? 
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Solution 1 
@BKKeconcode on Twitter 


Let p(k) be the probability for the first ace to appear at position k. Then 


ee 
PT yg 
48 4 
P(2) = 55 By 
48 47 4 
p(3) = 5'51 50 
48 47 46 4 
-25S t 
52 51 50 49 
i= IE? Ag j 4 
P= Il 5223] 532k 
j=0 
Note that 
plk+1) (49-k 4 53K 
p(k) \58—-k 52-—k 4 
49 —k 
Z 1 
52- kS” 


implying p(k + 1) < p(k) so that p(1), however small in itself, is the largest of all the 
probabilities. 


Solution 2 
Amit Itagi 


If the first ace is at position n, the number of ways of placing the remaining three aces 


52 — 
is ( 3 | Having chosen these four positions, the number of permutations of cards 


is 4!48!. Thus, the probability of the first ace being at position n is 


K — ") 
52—n\ 4!48! 3 
P(n)=( 3 ) 52\ 
4 
52—n)\. ; ee 3 
( 3 ) is a decreasing function of n. Thus, the probability is maximized for n = 1. 


Coin Tossing Surprises | 
A. Bogomolny 


Notes 


Since all four possible outcomes of two coin tosses are equiprobable, a couple of facts 
may seem surprising and, as such, have been discussed in a number of publications, 
e.g., by Martin Gardner in his The Colossal Book of Mathematics [44]. 
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First, as the transition diagram (Markov chain) shows, except for an appearance 
at the first two tosses, TT may only show up after HT, Figure [3.3] 


Figure 3.3: Coin tossing: whole picture. 


So perhaps the diagram makes it less surprising that the expected number of tosses 
to the first appearance of HT is less than that for TT. This phenomenon will acquire 
a more accurate estimate in these solutions. 


Solution to Question 1 

In Figure [3.4] we only focus on achieving HT. Assign a variable to each of the three 
nodes, say x, y, z, that express the expected number of tosses in reaching our present 
destination, namely, HT. Naturally, z = 0 and was only added for the sake of unifor- 
mity. The three are combined into two equations: 


ee ee: 
Y= TYT a7 
i oi 

=i . 

Me gt ay 


The inclusion of 1 in the equations is due to the fact that they describe the number 
of coin tosses, and each tells us how these expectations are connected after one toss. 
The first equation yields y = 2 so that, from the second equation, x = 4. 


Figure 3.4: Getting HT. 


Solution to Question 2 


As in question 1, we introduce the variables x, y, z, with z = 0, again: 


1 1 
y=1 a z 
i 1 1 
t= z7 TaY 
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1 2 
The second equation reduces to 7 = y+ zI so that y = 3° Then the first equation 


2 1 
becomes 37 =1+ z” from which x = 6. 


Solution to Question 3 
Each of the four possible outcomes of two coin tosses is equiprobable and comes with the 


1 
probability of ie This is the only opportunity for TT to come before HT (Figure]3.5). 
It follows that HT is three times more likely than TT to show up first. In other words, 


the probability of HT showing first is - 


OZ ORO 
Figure 3.5: Getting TT. 


Solution to Question 4 
Both HT and HH start with H but then split with equal probabilities of a It follows 
that the probability of HT preceding HH and the complementary one of H H preceding 
HT are both 3 

Zhuo Xi suggested that question while pointing out that HH, like TT, has 6 tosses 
as the expected time to first appearance. This shows that the probability of : from 


question 3 is unrelated to the ratio 6:8 of the “time to first appearance” computed in 
questions 1 and 2. 
We may summarize similar results for all pairs of outputs of two tosses: 


HH | HT | TH | TT 
1/1]1 
HH = || 4 | = 
2 | 4) 9 
gr £ r3 
2 2 |4 
rH 32 | 1 4 
4 | 2 2 
rr i | if i 
21/4] 2 


Chapter 4 


Basic Probability 


The very name calculus of probabilities is a paradox. Probability opposed 
to certainty is what we do not know, and how can we calculate what we 
do not know? 


Henri Poincare, 1913, Science and Hypothesis 


Riddles 


4.1 Admittance to a Tennis Club 
pp. 2-10], [51] pp. 4-6] 


As a condition for the acceptance to a tennis club, a novice player N is set to meet two 
members of the club, G (good) and T (top), in three games. In order to be accepted, 
N must win against both G and T in two successive games. N must choose one of the 
two schedules: playing G, T, G or T, G, T. Which one should he choose? 


4.2 Black Boxes in a Chain 


N black boxes are chained so that the output of one is the input of the next. The 
inputs (and the outputs) are binary, 0 or 1. 


DEDE De 


Each box passes its input on with probability p and changes it with probability 1 — p. 
What is the probability of having the last output the same as the original input? 


4.3 A Question of Checkmate 


Problem 2.5] 


The white king is randomly placed in the third row, the black king in the first row 
and the white queen either in the second row or elsewhere in the first row. What is 
the probability that the pieces are in a checkmate position? 


4.4 Concerning Even Number of Heads 


Flipping a coin, heads comes up with the probability of p0 < p< 1. 


e What is the probability of having an even number of heads in n flips? 


e What is the expected number of flips before getting an even number of heads the 
first time? 
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4.5 Getting Ahead by Two Points 
Proof 3] 


A and B are playing ping-pong with the agreement that the winner of a game will get 
1 point and the loser 0 points; the match ends as soon as one of the players is ahead 
by 2 points or the number of games reaches 6. Suppose that the probabilities of A and 


2 1 
B winning a game are 3 and 7 respectively, and each game is independent. Find the 


expectation E (£) for the match ending with € games. 


4.6 Recollecting Forgotten Digit 
103 


You forget a digit of a phone number and make random guesses. What is the probability 
that you’ll guess correctly in at most two attempts? Three attempts? 


4.7 Two Balls of the Same Color 
Problem 65284] 


In an urn there are black and white balls, 121 balls in all. The probability of randomly 
drawing two balls of the same color equals 0.5. 
How many balls of each color are there? 


4.8 To Bet or Not to Bet 


p. 317] 


I invite you to play the following card game: 

Shuffle an ordinary pack of cards and deal them in pairs, face up. If both cards 
of a pair are black, you get them; if both are red, I get them. Otherwise, the cards 
are ignored. 

You pay $1 for the privilege of playing the game. When the deck is exhausted, the 
game is over, and you pay nothing if you have no more cards than I have. On the other 
hand, I will pay you $3 for every card that you have more than I have. 

Would you care to play with me? 


4.9 Lights on a Christmas Tree 
Problem 65277] 


For the holidays, a Christmas tree was adorned with 100 lights. The lights were num- 
bered from 1 through 100. A controller was programmed to apply a switching algo- 
rithm: on every step the light that was on is turned off and vice versa. The algorithm 
worked in a sequence. First it turned all lights on. It then applied to the even numbers, 
then to the multiples of 3, next to the multiples of 4, and so on. 

What was the probability that a randomly selected light would be on after 100 
steps of the algorithm? 


4.10 Drawing Numbers and Summing Them Up 
Problem 24] 


A bag contains four pieces of paper, each labeled with one of the digits, 1, 2, 3, 4, with 
no repeats. Three of these pieces are drawn, one at a time without replacement, to 
construct a three-digit number. What is the probability that the three-digit number is 
a multiple of 3? 
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4.11 Numbers in a Square 


Each of the 49 entries in a square 7 x 7 table is filled with an integer, 1,2,...,7, so 
that each column contains all of the seven integers and the table is symmetric with 
respect to its diagonal D going from the upper left corner to the lower right corner. 
What is the probability that the diagonal D has all the integers 1,2,...,7? 


4.12 Converting Temperature from °C to °F 


Problem 108| 


A two-digit street temperature display flashes back and forth between temperature 
in degrees Fahrenheit and degrees Centigrade (Celsius). Suppose that over the course 
of a week in the summer, the temperature was uniformly distributed between 15°C 
and 25°C. 

What is the probability that, at any given time, the rounded value in degrees F 
of the converted temperature (from degrees C) is not the same as the value obtained 
by first rounding the temperature in degrees C, then converting to degrees F, then 
rounding once more. 


4.13 Probability of No Distinct Positive Roots 
Christopher D. Long, Riddle on page 


An ordered pair (b,c) of real numbers, each with an absolute value less than or equal 
to five, is chosen at random with each such ordered pair having an equal likelihood of 
being chosen. 

What is the probability that the equation z? + bz + c = 0 will not have distinct 
positive roots? 


4.14 Playing with Integers and Limits 


Problem 68] 


A game starts with a random integer. Two players take turns modifying the integer 
present on their turn. A legal move depends on whether the integer is odd or even. If it 
is even, the player can either subtract 1 or divide by 2. If the number is odd, the player 
may subtract 1 or subtract 1 and subsequently divide by 2. The game ends when the 
number reaches 0, and the player who made the last move wins. 

Assuming both players use the best available strategies, if the starting number 
ranges from 1 to some n, with n > co, what is the probability of a win for the second 
player? 


4.15 Given Probability, Find the Sample Space 
Chapter: From Sex to Quadratic Forms] 


A school teacher is in charge of a group of students. She wants to select two of them 
at random and observes that it is exactly an even chance (50%) that they are of the 
same sex. What can be said about the number of children of each sex in the group? 


4.16 Gladiator Games 
pp. 76-77] 


There are two teams of gladiators, each of which has been assigned a strength attribute 
(a positive number). When two gladiators from different teams meet, their chances 
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to win the duel are proportional to their strengths. A fellow who loses a duel, dies 
and takes no part in further combats. The winner—who returns to the bench and is 
immediately ready for another fight—has his strength augmented by that of the loser. 
The last team standing wins. 

Let there be two teams with individual strengths (in, say, non-decreasing order) 
{a1,@2,..-,@m} and {b1,b2,...,bn}. Find an optimal strategy for a team, i.e., the 
order of sending team members to duel opponents from the other team, to maximize 
its chances of winning the competition. 


4.17 In Praise of Odds 
Chapter 12] 


Teams A and B play a series of games, each with three possible outcomes: A wins, B 

wins or they tie. The probability that A wins is p, the probability that B wins is q and 

the probability that they tie is r = 1 — p — q. The series ends when one team has won 

two more games than the other, that team being declared the winner of the series. 
What is the probability that team A wins the series? 


4.18 Probability in Scoring 
Problem 65264] 


A boy plays a game of Scoring with a computer without yet learning to play optimally, 
but the computer makes no mistakes. They start with a pile of 14 stones and remove 
1, 2, 3, 4 or 5 stones at a time. The player to remove the last stone wins. The boy 
goes first. 

What is the probability of his winning the game? 


4.19 Probability of 2” Beginning with Digit 1 
103 


What is the probability that a power of 2 begins with the digit 1? 
For an event A that could be defined for all n € N, A(N) = AN [1,2,..., N]. The 
|AW)| 


probability P(A) is defined as Nim, — 


4.20 Probability of First Digits in a Sequence of Powers 
[90]. The riddle is an extension of the previous one, Riddle 


Let N be an r-digit number. What is the probability that the first r digits of 2” 
represent N? 


4.21 Probability of Four Random Integers Having a Common Factor 
103 


What is the probability of four random integers having a common factor? 


4.22 Probability of a Cube Ending with 11 
What is the probability that the cube of a positive integer chosen at random ends with 
the digits 11. 
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4.23 Odds and Chances in Horse Race Betting 
Problem 313] 


b 
If the odds against a horse winning a race are a : b, then we may call Tl the apparent 
a 


chance of that horse winning the race. 

Prove that, if the sum of all apparent chances of winning of all the horses in the 
race is less than 1, one can arrange bets so as to make sure of winning the same sum 
of money whatever the outcome of the race. 


4.24 Acting As a Team 
Chapter 6], pp. 120, 123-126] 


Three hats (each of which is either red or blue) are placed on the heads of three people 
facing each other. Each has to guess the color of his hat, if possible. There are some 
rules to the task: 


1. The players act as a team. The team wins or loses, not the individuals. 


2. After the hats have been placed on the heads there must be no communication 
between team members. 


3. Each player may guess only once and all those who choose to make a guess do 
so simultaneously on a signal. 


4. Each player is allowed to pass rather than make a guess. 


5. The team wins only if at least one player makes a correct guess and no player 
guesses incorrectly. 


Devise a strategy (that can be discussed before the game starts) and compute the 
resulting probability of success. Try to make the latter as large as you can imagine. 


4.25 Sum of Two Outcomes of Tossing Three Dice 
AFSME 1996 47-16 


A fair standard six-sided die is tossed three times. Given that the sum of the first two 
tosses equals the third, what is the probability that at least one 2 is tossed? 


4.26 Chess Players Truel 


Players A, B and C play a series of chess games (a truel). Assume that A is the 
strongest player and C is the weakest one. Assume that there is no tie for each game. 
The winner of each game will play with the third player. The player who first gets 
two wins is the winner of the series. Player B determines who will play the first game. 
Find the best choice for B. In general, if probability of A to win against B is p > .5, 
probability of B to win against C is q > .5, probability of A to win against C is r and 
r > p, evaluate the chances of B. 
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4.27 Two Loaded Dice 
Problem 8.7] 


Show that if two dice are loaded with the same probability distribution, the probability 


1 
of doubles is always at least A 


4.28 Crossing Bridge in Crowds 


It takes 5 minutes to cross a certain bridge and 1000 people cross it in a day of 12 
hours, all times of day being equally likely. 
Find the probability that there will be nobody on the bridge at noon. 
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Solutions 


Admittance to a Tennis Club 


Let g and t denote the probabilities of N beating G and T, respectively. The possibil- 
ities for the sequence TGT can be summarized in the following table: 


| T | G | T | Probability | 
w\/|wiw tgt 
W/W |L | tg(1-t) 
LIW W| (1-dgt 


Pertinent to the previous discussion is the observation that the first two rows nat- 
urally combine into one: the probability of the first two wins is 


P(WW) = tgt + tg(1 — t) = tg, 


which is simply the probability of beating both T and G (in the first two games in 
particular). 

Since winning the first two games and losing the first game but winning the second 
and the third are mutually exclusive events, the sum rule applies. Gaining acceptance 
playing the TGT sequence has the total probability of 


Prer = tg + tg(1 — t) = tg(2 — t). 
Similarly, the probability of acceptance for the GTG schedule is based on the 
following table: 
| G | T | G | Probability | 
WIW gt 
WW) (1-g)tg 


The probability in this case is found to be 
Pera = gt + gt(1 — g) = gt(2 — g). 


This is a curiosity. Do you see why? 
Assuming that the top member T is a better player than just the good one G, 
t < g. But then gt(2 — g) < tg(2 — t). In other words, 


Pare < Prar. 


The novice N has a better chance of being admitted to the club by playing the 
apparently more difficult sequence TGT than the easier one GTG. Perhaps there is a 
moral to the story/riddle: the more difficult tasks offer greater rewards. 


Black Boxes in a Chain 


Solution 1 
The original input and the final output will coincide if the number of input/output 
reversals is even; they will differ if the number of reversals is odd. 

For each box, imagine a coin toss that comes up heads with a probability p decides 
the behavior of the box. We know that after N transmissions the probability of having 


1 1 
an even number of heads is P(N) = 5 + 5 — 2p)”. 
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If N is even, then the probability of having an even number of tails, i.e., an even 

1 1 
number input/output reversals, is Q(N) = P(N) = 5 + 5 — 2p). But, if N is 
odd then getting an even number of tails is the same as getting an odd number of 


heads, i.e., 


1 
2 


Q(N) =1- P(N) =1 (5+50 apy") = 5 5-20)", 


The two expressions for Q(N) can be combined into a single formula: 


_4\N 
CN" a apy = t Sop 1%. 


Solution 2 


With the same reasoning as solution 1, if q = 1 — p then the probability of getting an 
even number of tails is 


1 1 
N) == ++- 2)" 
QN) =5 +30- 20) 
1 1 
=5 + 501-202)” 
1 1 
= =+ Ope 1)”. 
zt3r-1) 
Solution 3 
Amit Itagi 


Any number of flips will do. Let x = p and y = 1 — p. Then the required probability is 


N 1 i 
» = 5 [yt+a)"+y-2)"] = 5 [1+ (2p-1)*]. 


k is even 


A Question of Checkmate 


This is clearly a counting problem in which the probability of an event is determined in 
the classical manner, as the ratio of the number of successful outcomes to the number 
of all possible outcomes. The latter is immediate: The white king may be in one of 
eight positions in the 3rd row. Similarly, the black king may be in one of eight positions 
in the first row. The white queen is either in one of the eight positions in the 2nd row 
or in one of the remaining seven in the first. By the product rule, the total number of 
possibilities is 8 x 8 x (8 + 7) = 960. 

The task is to count the number of the successful outcomes, i.e., the number of 
positions in which the black king is checkmated. There are several cases: two for the 
black king in a corner, and three for the black king elsewhere. 


The Black King in a Corner 


For each corner, the white pieces may be in 16 positions, with the total of 2 x 16 = 32 
possible configurations, Figure [4-1] 
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Figure 4.1: The black king in a corner. 


For each corner, there is one extra position (+2), Figure [4.2] 


Figure 4.2: The black king in a corner, extra position. 


The Black King Not in a Corner 
There are five positions when the kings are in opposition (+6 x 5 = 30), Figure [4.3] 


Figure 4.3: The kings are in opposition. 


There are three positions with the white queen in the second row (+6 x 3 = 18), 


Figure 


Figure 4.4: The white queen in the second row. 


There are five positions where the kings are one vertical position apart and the black 
king is blocked by the queen from moving into the second row. This could happen in 
either direction (+2 x 5 = 10), Figure [4.5] 
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Gg w 


Figure 4.5: The kings are one vertical position apart. 


All in all, the number of checkmate positions is 32 + 2 + 30 + 18 + 10 = 92. The 


sa : . . 92 23 
probability of this happening is 960 ~ 240° 


Concerning Even Number of Heads 


Let P(n) be the probability of having an even number of heads after n coin tosses. 
Obvioiusly, P(0) = 1, as 0 is an even number. Also, 1 — P(n) is the probability of 
having an odd number of heads after n tosses. 

If the number of heads was even after n tosses, it will remain even next time with 
probability 1 — p; if the number of heads was odd it will become even with probabiliy 
p. Thus, we can establish that P(n) satisfies the following recurrence relation: 


P(n +1) =p(1— P(n)) + (1—p)P(n) = p + (1— 2p)P(n), 


with the initial condition P(0) = 1. To make sure that we are on the right track, 
P(1) = p + (1 — 2p)P(0) = 1 — p, the probability of having a tails and, in this case, 
keeping the number of heads even. To solve the recurrence, assume P(n) = A+ Ba”, 
for some constants A, B and a: 


A+ Ba"! = p+ (1—2p)(A+ Ba”). 
Handling the expressions by A and B separately gives 


A = p+(1—2p)A 
Ba (1 — 2p)Ba”. 


1 
We find A = 5 independent of p, and B arbitrary, provided a = 1 — 2p. Thus, the 


solution to the recurrence relation appears to be 


P(n)= > + B(1 — 2p)”. 


1 
To detemine B, we will use the initial condition: 1 = P(0) = 5 + B, which gives 


1 1 1 
B= 5 with the final form of the solution P(n) = zT z0 — 2p)”. 


1 1 
For a fair coin, i.e., if p = 7 P(n) = 7 independent of n. 


To answer the second question, for the first appearance of an even number of heads 
in n tosses, 


e the last one needs to be heads, 
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e the total number of heads needs to be 2 (in the limit the rare event of no heads 
will play no role), 


e the first heads could be in any of the first n — 1 tosses. 


Thus, we get the expectation of the first occurrence of an even number of heads: 


Co 


a 2 
E(p) = X n(n — 10 — pp? = = 
n=2 P 
(Denote, say, f(x) = 5 n(n—1)a"~?. Integrate twice from 0 to z to get 5 CS 
n=2 n=2 
x 2 


. Differentiate twice to find f(x) = z: Then substitute x = 1 — p and 


C-z) 


1-2 


multiply by p? that was left over.) 


Thin) Sen eaa ee 
4 2 4} 3 


Getting Ahead by Two Points 


Solution 1 


It is easy to see that € can only be 2, 4 or 6. So, we divide six games into three rounds 
of two consecutive games each. If one of the players wins two games in the first round, 
the match ends with the probability of 


Otherwise, the players tie with each other, earning 1 point each, and the match 
4 
enters the second round; the probability of this happening is 9° 


We have similar discussions for the second and the third rounds. Thus, 


Pe=2)=2, 
4 5 20 
PERAS 9 Sp 
2 
pe=6)=(5) = 
Finally, 


5 2 
BE =2. 9 +4. +6. = 
Solution 2 
Amit Itagi 


Let A; denote the event that A wins the k*® game, and A; denote that B wins the 


kt? game. Since A; and A; are incompatible and are independent of the other events, 
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we have 
Pes PAA PA 


P(€ = 4) = P(A, A2A3Aq) + P(A1 A243414) + P(A) 424344) + P(A] A243414) 
a /1 R 
3 3 


81’ 
P(€ = 6) = P(A, A2A3A4) + P(A, A2A3A4) + P(A) A424344) + P(A] A243414) 


_ 16 
81" 
It follows that, 
Re=2.° ido | g. 16 — 266 
9 81 81 81 


Following is a graphical representation of the solution. 


Scores in parentheses; Branch probabilities in red; Node cumulative 
probabilities in blue and green „a for terminal nodes) 


(1,0)2/3 onu 
2/3 1/3 2/3 1/3 
(2.0) (1.1) (0.2 
4/9 — 4/9 ceo 1/9 
2,1)8/27 2)4/27 
2/3 Mahe 2/3 arr 
3,1 ps 2,2 ; 1,3 
ven 2/3 a 1/3 ae 
(3,2) 32/243 (2, 302a 
2/3 1/3 2/3 1/3 
(4,2 (3,3) (2,4) 
64/729 64/729 16/729 


Elé] = P(1) +2- P(2) +3- P(3) +4- P(4) +5- P(5) +6- P(6) 
=0+ 2. (4/9 + 1/9) +3- 0+4- (16/81 + 4/81) + 
5- 0 +6- (64/729 + 64/729 + 16/729) 
= 266/81 ~ 3.284 
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Recollecting Forgotten Digit 


Solution 1 


The riddle is a variant of drawing balls of two colors (Riddle [3.1] on page [15}, where, 
1 
say w = 1, b = 0 and w +b = 10. P(W1) = P(W2) = P(W3) = 10 An important 


observation is that successive guesses are independent events, Appendix[A]on page 
The (first) event we are looking for is W = W, U W2 which comes with the proba- 


1 1 1 

bility P ns ne 

my Way 0 
For three attempts the probability is 1 + 1 | i 
oP Eterna nares eg o ge 0 


Solution 2 
1 
The probability of a correct guess is p = To" Two attempts will suffice if we guess 


correctly the first time or if we fail and guess correctly on a second attempt. This gives 
the probability as 


ve 1\ 1 2 
10 10/ 9 10 


For three guesses, 


1 1 1 1 1 1 

P(Success) = T + (1 5) 3 + (1 5) (1 5) B 
1 
8 


1 9 1 9 8 
10 10 9° 10 9 
3 
10 


Solution 3 
Imagine that we try two times, regardless of whether the first guess was correct. Then 
there are 10 x 9 ordered pairs of numbers that could be “recollected” with equal chances. 
Of these, nine include the right guess on the first attempt and nine on the second. In 
all, there are 9+ 9 = 18 pairs that realize a successful recollection. The probability of 
. 18 
this event is — = =. 
90 5 
For the second part, we make three guesses regardless of whether either the first or 


the second was successful. There are 10 x 9 x 8 = 720 ordered triples to choose from. 
If the first guess was a hit, there are 9 x 8 = 72 variants for the second and the third 
guesses. With the second correct guess, there are nine first guess failures times eight 
distinct but irrelevant third guesses, 9 x 8 = 72 in all. The right third guess comes 
after 9 x 8 = 72 failures in the first and second guesses, giving the probability of the 
success in three guesses as 


72+72+72 3 
720 ~ 10° 
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Two Balls of the Same Color 


Solution 1 
Let there be b black and w white balls. We have b+ w = 121 and 


w(w— 1) + b(b-1) 
121-120 


It follows that b and w satisfy another equation: 
121-60 = w? +b? — (w +b) = w? +b? — 121, 


so that w? + b? = 121-61. Now, (w + b)? = 121? from which wb = 121 - 30. Thus, w 
and b are the roots of the quadratic equation: 


x” — 121x + 121-30. 
Applying the quadratic formula, 


1214+ /1212 — 4-121 - 30 
21,2 = 
2 
_ 121 +/121(121 — 120) 12111 
= ; = A 


Then, e.g., 2; = 55 and z2 = 66 which could be the numbers of black and white 
balls or vice versa. 
55 and 66 are successive triangular numbers and sum up to 121, a square. 


Solution 2 
Using the same framework from solution 1, 
1 w w—-l — b b-1 
2 wtb wtb-1 wtb wtb-1 
w(w —1)+b(b—-1) 
(w+b)-(w+b—1)’ 


and multiplying out, we get 

(w + b)(w +b-— 1) = 2w(w — 1) + 2b(b — 1) = 2w? + 2b? — 2- (w + b), 
or 

(w — b)? = w +b = 121, 


meaning that, assuming w > b, w — b = 11. Combining this with w + b = 121, we get 
the same result as above. 
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Solution 3 
N.N. Taleb 


We have four possible outcomes: 


W W 
W B 
S m 
B W 
B B 
We have 
W W-i1 B B-1 1 
P BB)= . . = 
EN ) 121 120 T 121 120 2 
W B B W 1 
P(W B||BW) = 


121 120 121 120 2° 


Solving, we get B? + W? = 7260 +b + W and BW = 3630, from which one root is 
66 and the other is 55. 


To Bet or Not to Bet 


As a matter of fact, we always end up with the same number of cards. Thus, you 
always lose $1. 


Lights on a Christmas Tree 


Solution 
The state of a light depends on the number of factors of the corresponding number. 
All integers but squares have an even number of distinct factors. For every factor, a 
light is switched on/off. After an even number of such operations the light is in its 
original state—off. 

The last light to which the algorithm applies is #100. At this point, the only lights 
that are on correspond to the squares between 1 and 100, inclusive. There are 10 of 

1 


them. Thus, the probability of a random light being on is 100 10° 


Illustration 
Marcos Carreira 


A step-by-step of the lights’ on/off state is well represented by the following diagram. 
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Drawing Numbers and Summing Them Up 


Notes 

We will give two solutions to the riddle. The starting point for both is the criteria 
of divisibility by 3. A number is divisible by 3 if and only if the sum of its digits is 
divisible by 3. The digits in the box add up to 10. The three digits drawn may add up 
to2+34+4=9,14+34+4=8,14+24+4=7o0rl+2+43=6. Two of these (6 and 
9) are divisible by 3. The second observation is that the sum of the digits of a number 
does not depend on the order in which the digits are added because addition of the 
integers is commutative. This means that the order in which the digits are drawn from 
the box has no effect on the divisibility by 3 of the final number. 


Solution 1 


We are looking into the probability of drawing the numbers composed either of the 
digits 1, 2, 3 or of the digits 2, 3, 4. Let us first consider the probability of drawing 


3 
the digits 1, 2, 3. The probability of drawing one of these on the first draw is 7 The 
2 
probability of drawing the second one out of the remaining three is 3 and that of getting 


1 
the third on the last drawing is 5 Since the order of the digits’ appearance from the 


box is not important, the three drawings are independent and the probability of getting 


3 2 1 4 
the three particular digits 1, 2, 3 is then the product of the three: p = 13a a 
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1 
So, the probability of getting the number composed of the digits 1, 2, 3 is T The 
same reasoning applies to the digits 2, 3, 4 so that the probability of getting the latter 


1 
three is also T Finally, the two events—one of getting the digits 1, 2, 3 and the other 


of getting the digits 2, 3, 4—do not occur at the same time. You get either one or 
the other (the two are mutually exclusive) meaning that the combined event has the 
1 


1 1 
probability which is the sum of the two probabilities: 1 + IF 


Solution 2 

Let us focus on the number that remains in the box after three drawings. It may be 
any one of the digits 1, 2, 3, 4. When this is 1 or 4 that is left in the box the three 
drawn digits add up to a number divisible by 3. When the left-over digit is either 2 or 
3, the number drawn is not divisible by 3. Thus, in half the cases (two out of four) the 
number drawn is divisible by 3. The two possibilities are equiprobable and mutually 


1 
exclusive, implying that each of the possibilities has the probability of 5° 


Numbers in a Square 


The total number of “1” entries in the table is odd. However, due to the symmetry con- 
dition, the number of 1s off the diagonal D is even. It follows that there is a 1 on the 
diagonal. The same holds for other entries and other numbers. Therefore, all seven inte- 
gers 1 through 7 appear on the diagonal, implying that the probability in question is 1. 


Converting Temperature from °C to °F 
Amit Itagi 


The conversion from °C to °F is given by 
9 


The 32 is just an integral constant, the rounding steps do not affect it, and it can be 
dropped in the context of this riddle. Also, the riddle is periodic in C with a period of 
5. So without loss of generality, we can map all the C values modulo 5 to a range [0, 5]. 
In fact, the range [2.5, 5.0] can also be mapped to [0, 2.5] by using a map f(x) = 5.0— x 
without changing the considerations of the riddle. This can be seen from the following 
argument. Consider two intervals p € [0,2.5) and q € (2.5,5] for C (modulo 5) (the 
second and third columns in the following table). The notation R(-) is for rounding. 
Whenever p is different from R(p) or 9p/5 is different from R(9p/5), q is symmetrically 
different from R(q) and 9q/5 from R(9q/5). 


C 5K +p 5K +q 
(=5K+5-p) 
R(C) |5K+R(p)| 5K +5—R(p) 
90/5 | 9K +9p/5 9K +9q/5 


(= 9K +9 -— 9p/5) 
R(9C/5) | R(9p/5) 9 — R(9p/5) 
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9 
Thus, without loss of generality, let us assume 0 < C < 2.5 and F = z Hence, 


with or without any of the rounding steps, the converted F will lie in [0, 4.5]. The 
notation R(-) is used for rounding. The different intervals are tabulated in the following: 


C R(C)| 9C/5 |9 R(C)/5 9C/5 Intervals with 
Differences in Rounded Values 
0.0 — 0.5 0 0.0 — 0.9 0.0 0.5 — 0.9 
0.5 — 1.0 1 0.9— 1.8 1.8 0.9 — 1.5 
1.0 — 1.5 1 1.8 — 2.7 1.8 2.5 — 2.7 
1.5 — 2.0 2 2.7 — 3.6 3.6 2.7— 3.5 
2.0 — 2.5 2 3.6 — 4.5 3.6 N/A 


Thus, the required probability is 


(0.9 — 0.5) + (1.5 — 0.9) + (2.7 — 2.5) + (3.5-2.7) 2 
4.5 45 9 


Illustration 
Attila Kun 


The following graph lets one visually compare the two kinds of conversion: 


yi 


The graph was produced by this small piece of code: 


x <- seq(15, 25, 0.001) 


y1 <- round(1.8*x + 32) 
y2 <- round(32 + 1.8*(round(x))) 


plot(x, y1, type="1", col="red") 
lines(x, y2, col="green") 


1 
2 
3 
4 
5 
6 
7 
8 sum((y1 - y2)^2 > 0) / length(x) # p ~ 0.447555 
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Notes 
Keith Dawid 


The discrepancies by range are listed in the following table: 


15° 16" — 1/9 
16° —17° =. 2/9 
17° —18° — 3/9 
18° —19 — 4/9 
19° — 20° — 5/9 
20° — 21° — 1/9 
21° = 99 — 2/9 
22° — 23° — 3/9 


Probability of No Distinct Positive Roots 


As in the original Riddle on page [|261| the basis for the solution is naturally the 
—b + vV b2 — 4c 
5 : 
Now, it may be easier to determine when the equation does have distinct positive 
roots. This happens when c > 0, b < 0 and b? — 4c > 0. 
With b on the horizontal axis and c on the vertical one, we have to compute the 


b2 
area under the parabola c = T inside the square [—5,5] x [—5,5], relative to the 


quadratic formula: 21,2 = 


square itself. 


The parabola crosses c = 5 for b = — v20. The area in question is 


s=56- v+ | _ =56- V3 +5=56- V), 
— v20 


so that the probability of having distinct positive roots is 


56- v2) _ 3- v5 


100 10 
with the complementary probability 
3=- v5 T7+v5 
1 5 zx 0.92 
Ti Ti 0.923607 


This is a little more than in the original riddle (= = 0.917355). 
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Playing with Integers and Limits 


Try playing the game with small starting numbers. 

If the starting number is 1, the player whose turn it is to make a move (i.e., the 
first player) wins. 

If the starting number is 2, regardless of the move, the first player leaves 1, meaning 
that the second player wins. 

If the starting number is 3, the first player may leave either 1 or 2 and become the 
second player on the next move. A good player would leave 2 and win the game. 

Here is a proposition: 


If the starting integer N has an even number of factors of 2 (in particular 
when N is odd), the first player should win. If the starting integer N has 
an odd number of factors of 2, the second player should win. 


Solution 1 


We will prove the above statement by strong mathematical induction. 
Assume it is true for any N < k, for some k > 1. Consider N = k + 1. There are 
three cases: 


Case 1 
k +1 is odd. 


k k 
The first player may move to either k or x One of k or 3 has an odd number of 


factors of 2. According to the proposition, this is the number the first player should 
leave on the first move, after which he becomes the second player and wins. 


Case 2 


k +1 is even and has an odd number of factors of 2. 


1 
k is odd and, if left odd, the other player wins. If the first player leaves = ; 


which has an even number of factors 2, the other player still wins. 


Case 3 
k + 1 is even and has an even number of factors of 2. 


k+1 
The first player loses if he leaves the odd k. If he leaves E, then, according to 


case 2, he will win after becoming the second player. 

Thus, in cases 1 and 3, the first player should win; in case 2, the second player wins. 
In other words, the second player wins if the starting number has an odd number of 
factors of 2. How many such numbers are there? Let us count: 


e Every number in the form 2n where n is odd: these are numbers 2,6,10,.... 
Every fourth integer is in this form. So, the probability of falling into that group 

1 

is —. 

4 
e Every number in the form 8n where n is odd: these are numbers 8, 24, 40,.... 
These numbers differ by 16 = 47. So every sixteenth integer falls into that group. 
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We may continue in this manner, but it becomes clear that the proportion of 
integers having an odd number of factors 2 is the sum of the series 


This is (more or less) the probability of a win for the second player. 


Solution 2 
Amit Itagi 


We claim that the starting number n for the first player uniquely determines the 
outcome of the game. If this claim is true, then we can define a function f: 


, if player 1 wins 
fin) = ne 
0, if player 2 wins. 


Clearly, f(1) = 1 and f(2) = 0. f(2) = 0 because if the first player starts at 2, the 
only choice available is 1 and then the second player wins by going from 1 to 0. We 
validate our claim by showing that the function gets recursively defined for all n in the 
following way: if the possible two outcomes of the current move lead to a place where 
the person making the next move is guaranteed to win, then the person making the 
current move is guaranteed to lose. Hence, 


0, if f(2k—1) =1and f(k) =1 


1, otherwise; 


f(2k) = 


and 


0, if f(2k)=1and f(k) =1 


pees fi 


, otherwise. 


The definition of f(2k) leads to the following implication: 
f(k) =0=> f(2k) =1. 


Thus, the first condition in the definition of f(2k + 1) can never be satisfied and 
f(2k + 1) = 1 unconditionally. This in turn proves the converse of the preceding 
implication: 


f(k) =0 f(2k) =1. 


Thus, from the definition of f(2k), f(4k + 2) = 0 because f(4k + 1) = 1 and 
f(2k+1) =1. Thus, the only numbers left to investigate are the numbers of the form 
4k. Because multiplying by 2 flips the function, we end up with the following tree like 
structure: 


54 Cut the Knot: Probability Riddles 


we Pe 


4k T 4k+1 (1) 4k+2 (0) 4k+3 (1) 


16k+8 (0) 16k 


32k+16 (1) 32k... 


The nodes show the form of the number and the number in the parentheses shows 
the value of f(n). Thus, the probability that the second player wins is the probability 
of getting f(n) = 0. The required probability is 


Given Probability, Find the Sample Space 


Solution 1 

Simon Norton 

(6+ 9)(0+ 9-1) 
2 

bg pairs are of different sexes. To ensure the 50% chance of the same sex selection, the 

former number needs to be twice the latter: 


(b+ g9)(b+ 9-1) 
2 


pairs of which 


From a group of b boys and g girls a teacher can form 


= 2bg 


which reduces to 
(b- g9) =b+g. 
Letting b — g = n leads to a system, 


b-g=n 
b+g=n’, 
1 =a 
from which b = a, g= a Cases where n = 0 (no kids at all) or n = +1 


(one sex is missing) can be disregarded. Other than those, n could be any integer. 
When n is negative, g > b; when it is positive, b > g. 
In any event, b and g need to be two consecutive triangular numbers. 
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Solution 2 
Amit Itagi 


Suppose there are b boys and g girls. The number of ways of choosing two boys is 
C(b, 2), the number of ways of choosing two girls is C(g, 2) and the number of ways of 
choosing a boy and a girl is bg. Thus, the condition of the riddle implies 


C(b,2) + C(g,2) = bg 
b(b— 1) + g9(g- 1) 2bg 
1 1 
b = It3t3 8g +1 
Note, b = g results in b = g = 0. Thus, b ¥ g. Noting that the riddle is symmetric in 
b and g, let us find solutions with b > g and get the other set of solutions by swapping 
b and g from the first set. For b > g, we have to drop the negative square root. 


Noting that any odd square is of the form (8k + 1), let 8g +1 = (2n +1)? for n > 0. 
Thus, the first set of solutions is 


(Q2n+1)?-1 n(n+1) 


8 2 
panre iha ee Din 2): 
8 2 


and the alternate set of solutions is 
(n+ 1)(n+ 2) 
2 
n(n + 1) 
= 3 , 


where n is any positive integer. 


b= 


Solution 3 
N.N. Taleb 


The binomial distribution with n trials and success probability p is 


(1 — p)” *p? C) 0<zr<n 
x 


0 otherwise. 


P(X =2) = f(z) = 


1 
Here we have f(2) = z Let n = ng + ng be the total number of pupils, with ng 


the initial number of boys, etc. Let p = CB as, We have the following tuples 
n na +nB 
in a draw of 2: 
pop ees 
n n-l1 
gp g S 
nn-1 
G p Pes 
nn-1 
G ng ng — 1 


n n-li 
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We have to solve for 


(na z= 1) NG 1 


(ng = 1) nB 
(ng + nge — 1) (ng + nq) 


(ng +n@—1) (ne 4+ ne) 2 
which reduces to 
4ngnG 


(ngp+ng@—1)(nptne) 


bl 


leading to two solutions, depending if ng > ng or the reverse: 


ng = = (+V8ng +14 2ng +1). 


1 = 
It follows that ng, np = z” + yn). To make all the quantities integer we need to 


assume that n is a square: n = k?, simplifying the above to ng, ng = gtk +1). 


Notes 
A. Bogomolny 


Rob Eastaway posted a modification on Twitter (@robeastaway): 


In a group of children, there are three boys. If I pick two children at random, 
there is a 50% chance both are boys. How many girls are in the group? 


n(n +1) 


This can be better generalized. Let Ta = be the n‘® triangular number. 


Then we can pose the following problem: 


In a group of children, there are Tn boys. If I pick two children at random, 
there is a 50% chance both are boys. How many girls are in the group? 


n(n —1) 


= i 
2 i 


Repeating the derivation above will lead to the answer g = 


3 
To answer Rob’s question, 3 = 2- 2 Tə. Therefore, in his case, the number of 


1-2 
girls is T} = En = 


This riddle (under the guise of drawing socks from a drawer) is also included in 


Problem 1]. 


Gladiator Games 


There is no optimal strategy. No strategy effects the chances of a team to win the 
n 


m 
competition. Let A = 5 a; and B = 5 bi. Then the chances of a team to win are 
i=1 i=1 


A 
proportional to its total strength, i.e., for the first team, it is ae B for the second, 


it is 


A+B 
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The proof is by induction on n+ m, assuming both n and m are greater than zero. 
The claim is certainly true for m = n = 1. Assume it is true for m+n = N, and 
now that one team still has m members, let the membership of the other grow to n+1. 


m n+1 
So now A=) a; and B= Se op 
i=l i=1 


For the first duel, the first team sends a gladiator of strength a, the second a 
gladiator of strength b. The first wins with the probability of A the second with 
a 


the probability of ; 
a+ 

Depending on who wins that duel, there will be two possible distributions 
of strength: 


1. a wins: {a1,...,@+0,...,@m} and {b1,...,0...,bn41}, 


2. b wins: {a1,...,@,...,@m} and {b1,...,b+a...,bn41}, 


where overline means the absence of the fighter. 
According to the induction assumption, in the first case, the chances of the first 


A- 
; in the second case they equal A making the overall 


a a 
+B’ +B’ 
probability as claimed: 


team to win equal 


a A+b i b A-a 
a+b A+B a+b ALB 
= aA+bA 
~ (a+b)(A+ B) 

A 
2a y 


a= 


In Praise of Odds 


The following solutions allow for a comparison of two approaches: a conventional 
approach with lengthy calculations of probabilities and the one based on estimat- 
ing the odds. As the reader may see, odds provide a significant shortcut for computing 
probabilities. 


Solution Setup 1 


The tied games are of no consequence in determining the winner of the series and can 
be ignored. We know, that the odds of A winning a game are p : q so that a step 
toward A winning a series has the probability of p’ = F For B the probability is 
Prd 
qd = ah Naturally, p' + q' = 1 and p’:q'=p:4q. 
p+q 


Solution Setup 2 


Let A denote the game won by A, B the game won by B and T the game that ended 
up tied. The latter is of no consequence in determining the winner of the series and 
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can be ignored. Then the series might look like this: 
TATTBATBITTBITA.... 


We shall split the sequence into “consequential” pieces by grouping a won game 
with the preceding ties: 


(TA)(TTB)(A)(TB)(TTTB)(TTA).... 


What matters at the end of the day is the last game in a group. The groups that 
end with A contribute to the advancement of A. These are: 


A,TA,TTA,TTTA,..., 
each exclusive of any other. The corresponding probabilities are p, rp, r7p,... that add 
up to 
1 _ P 
l-r pt+q 


p+rp+rp+ =p: 


which we shall denote as p’. q’ is defined similarly: q’ = oa 
Prd 


Solution 1 
OSWEGO Problems Group 


Either way, the riddle reduces to a win/lose sequence with the probabilities p’ and g. 

The last two games in a series are bound to be won by the same team and the 
games beforehand come in pairs of wins (or losses) for both teams. Thus, those pairs 
have the probability p'q'+q'p' = 2p'q'. A sequence of n such pairs comes with the prob- 
ability of (2p’q’)”. It follows that A wins a series of 2n + 2 games with the probability 
(2p'q')"(p')?. The total probability of A being declared a winner is 


P(A) = OP=OP 5 
k=0 k=0 
p’ (p +4)? p’ 


(p+q)? (pt+q)?-2pq P+ 


Solution 2 
The last two games in a series are bound to be won by the same team and the games 
beforehand come in pairs of wins (or losses) for both teams. The odds of A versus B 
winning two games in a row are p? : q’. It follows that A wins the series with the 
2 
a p 
probability of ———. 
pP +g? 


Probability in Scoring 


Solution 1 
A. Bogomolny 


The game of Scoring is a great tool for learning the modulo arithmetic. It is best 
analyzed starting from the end. The last move was the removal of 1, 2, 3, 4 or 5 stones. 
Thus, the previous player has left less than six stones. If he left six he would win. If 
he left more than six, the other player would be able to leave six stones and win. 
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So, whoever manages to leave six stones wins the game. Put another way, whoever 
takes the seventh stone wins. We can now apply the previous argument, but replace 
“first” with “seventh” and continue backward. The right strategy is then to leave a 
multiple of six after each move. If you do not, the other player will. 

The game starts with 14 stones; the boy should take two. If he does not, he loses 
right away. 

If he does, whichever move the computer makes next, the boy has one in six chance 
to make the only right move. The same applies to the next two moves, computer/boy. 

However they play, the right strategy is to leave a multiple of six. On the first move 


1 
the boy does that with the probability of 5 (he needs to remove two and leave 12 


stones). From there on there remain two pairs of moves, meaning that the boy yet has 
two opportunities to make a wrong move. As before, there is a one in five chance to 
111 1 
make a right move. Thus, the boy does that with the probability of 5°55 7 15" 
We may observe that the boy may reach the state of six stones with the probability 


1 
of 52 (hitting two on the first move and whatever it takes to get to six, depending on 


the computer’s move). After the computer’s move (probably random), the boy may be 
left with 1, 2, 3, 4 or 5 stones. Assuming that even the fellow who has yet to master 
the necessary skills to play Scoring is still clever enough to remove all the remaining 
stones in order to win the game, at this point, he wins it with certainty, making the 


1 
final probability of the win equal to 5: 


Solution 2 
Amit Itagi 


If player A gets the stone count to six, that player optimally forces a win. Player B 
can only leave 1-5, and in the final move, player A picks all the stones. 

With this observation, we note that if in the first move the boy gets the stone count 
to 13, 11, 10 or 9, the computer can force a win. In the cases of 9-11, the computer can 
bring the stone count down to six in a single move. For the case of 13, the computer 
can bring the stone count to 12 making 7-11 possible moves for the boy. The computer 
can then force a 6 irrespective of the stone count that the boy leaves. 

Thus, we focus on the case where the boy picks two stones in the first step and gets 


1 
the count down to 12. This first move has probability P) = 5° In this case, the boy 


always has an optimal path (not necessarily the chosen path) to victory. No matter 
what the computer chooses, the boy has only one in five ways of getting to six. In all 
other cases, the boy either has to leave a count greater than six with six accessible to 
the opponent in a single move or a count less than six such that all the stones can be 


picked up in one move. Thus, the probability of the boy winning in this case if P> = 
1 


Hence, the probability of the boy winning is P, - Po = 35° 


5 . 
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Probability of 2” Beginning with Digit 1 


Solution 1 
A. Bogomolny 


Observe that, for a given length (number of digits) there exists at most one power of 
2 that begins with digit 1. Moreover, for a given k > 1, there is always a power of 2 of 
length k. Exactly one of these begins with 1. Examples: 24 = 16, 2” = 128, 21° = 1024, 
214 = 16384. 

Indeed, for a given k, consider the largest power of 2 that has length k, say 2”. 
Then 2?*! is bound to have (k + 1) digits as it is less than 10-2? of (k + 1) digits and 
is larger than the largest among the powers of 2 with k digits. So, if there is a power 
of 2 with k digits, there is at least one with k + 1 digits. 

On the other hand, for a given k, the least power of 2 of lengths k is bound to 
start with 1. Otherwise, it could be divided by 2 to obtain a smaller power of 2 of the 
same length. 

It follows that for any k > 1 there exists exactly one power of 2 with k digits 
that begins with 1. Let q(K) denote the number of powers of 2 that begin with 1 
and are less than 10%, i.e., have no more than K digits. Then, what we found is 
that q(K) = K — 1. The total number N of powers of 2 below 10% comes from 
2N < 10% < 2+! so Nlog,,2 < K < (N + 1)logio 2, or K = Nlog,)2+ a, where 
0 <a < (N +1) logy) 2 — N logio 2 = logy) 2. Finally, 


K K-1)!l 2 
lim (K) = Jim (K = 1) logio 2 = log) 2- 
K—œ K-oo — Q 
Solution 2 
Amit Itagi 


If a positive number k is written in scientific notation m x 10” where m € (0,10) and 
n € Z*, then the mantissa is defined by f(k) = logy m = logio k— [logio k]. Noting that 
log;9 2 is irrational, from the equidistribution theorem [99], the set {f(2”) : n € Zt} 
is uniformly distributed on [0, 1). For the leading integer of k to be 1, its mantissa has 
to be less than logįọ 2. Thus, the required probability is 


log1o 2 
p= 2s ~ 0.301. 


Probability of First Digits in a Sequence of Powers 
Léo Sauvé 


A more general theorem holds where 2 can be replaced with any other integer, except 
for 10, and there is even a more general formulation. Given an r-digit number N, we 
may inquire about appearance of N as the first digits in the decimal representation of 
the powers of .2. From here, the riddle can be stated for an irrational a, not a rational 
power of 10: 


For a given r-digit number N, and an irrational a, as above, what is the 
probability that the decimal representation of a” begins with N. 


Define, for x > 0, u(x) as the mantissa of log x, that is, 


u(x) = loga — |log x| = log z(mod 1). 
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Let S = {a”}%,. Then (S) is dense in the open interval (0,1) which is proved 
with a relatively standard invocation of the pigeonhole principle [18]. See also another 
example: Riddle on page 

Due to the equidistribution theorem [99], the probability that the decimal rep- 
resentation of a” begins with N is the probability that u(a”) lies in the interval 
I = [w(N), uN + 1)) whose length is 


1 
p(N + 1) — w(N) = log(N + 1) — log N = log (1 + x) P 
Note that that probability is independent of a. 
Note also that, because the density of (S) is in (0,1), it follows that, for any N, 


there are infinitely many powers of a whose decimal representation starts with N. 


Probability of Four Random Integers Having a Common Factor 


Solution 1 


As in Riddle[3.5]on page|16 the probability of an integer to be even, i.e., to be divisible 


4 
1 1 
by 2, is z The probability of four numbers being even is (5) . Thus, the probability 


4 
1 
that at least one of the four numbers is odd is 1 — (5) ; 
Similarly, the probability that at least one of four random integers is not divisible 


4 4 

1 1 

by 3 is 1— G) . For any other prime, that is 1 — G) . 
Pp 


Since divisibility by one prime number is independent of divisibility by any other 
prime number, four random numbers have no common factor with the probability of 


4 
1 
l | i — G) | . It follows that the probability of four random numbers to have a 
p 


p prime 


common factor is 


-LO 


p prime 


Solution 2, Rational Estimate 


Co 


The sum 5 T is known as a zeta-function ¢(s), many of whose values are known 
k=1 
x 
specifically. In particular, ¢(4) = —. In the following, we will find some rational 


90 
estimates directly. We will start with a special case of the Euler-Kronecker formula: 


1 ie ek Bg En s 
(1— 274)(1 — 3-4)(1-—5-4)(1—7-4)--. 24 34 44 54 7 i 
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Now, we group the terms on the right by the powers of 2: 
< 1 1 1 les Slee ae ele ot oo 
>, kE 1 924 ' 34 44 ' 54° 64° 74 
k=1 


1 1 
ait (5: x 9 ; 


1 1 1 

FiA he S 

sees o 1 8 

EA BR ae 
8 


On the other hand, 


L] 1 E V 1 1 1 
ye ee T\ 34 44) \ 54 7 64 5 74 gi we 
k=1 


i 1 1 
Sipek ha 8e 
2 5 ( gt? ga ) 


1 1/1 1 1 SA 
=1+a talpra) -lta t) n 


k=2 
fl 1 1 15 
=1 det = 1+ 5G eb 
7 8 
We found that 
‘i =] 
8 1 15 
= i= = p 
7 s II | G) | is 14 
p prime 
so that 
14 i 7 
ta 21 pas — 
z<! I | o | DE 
p prime 
i.e. 


ssi Gle 


p prime 


So, the sought probability of four random integers having a common factor is 


1 1 
bet — and -. 
etween 7g and 3 
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Solution 3 
Amit Itagi 


1 
The probability that the it? prime p; divides an integer is —. Thus, the probability 
Di 


t 


1 
that a p; divides all four integers chosen is —, and the probability that p; is not a 
P; 


a 


1 
common factor of the four integers is 1 — —7. Hence, the desired probability is 
i 


” 1 1 90 
li 1— 1-— — =1-—~=1-— ~O. ; 
ta | II( a) (4) 1 tO 


i=l 


Probability of a Cube Ending with 11 


Solution 1 
If a? ends in the digits 11, then a? = 11 (mod 100). This is equivalent to the following 
two conditions: 


1. a? = 11 = 3 (mod 4) and 

2. a = 11 (mod 25). 

Easily, 3 is the only residue modulo 4 that satisfies a? = 3 (mod 4). 

Next, a? = 11 (mod 25) implies a? = 1(mod 5). Checking the residues, 1 is the 


only one that satisfies this condition. From here, suffice it to check 1, 6, 11, 16 and 21 
modulo 25: 


13 = 1 = 1 (mod 25) 

6° = 200 + 16 = 16 (mod 25) 

118 = 121-11 = 21 - 11 = 231 = 6 (mod 25) 

16° = 256 - 16 = 6-16 = 21 (mod 25) 

213 = 441 - 21 = 16-21 = 210 + 126 = 10 + 1 = 11 (mod 25). 


Thus, we are looking for an integer a such that a = 3 (mod 4) and a = 21 (mod 25). 
By the Chinese remainder theorem we find that the only number below 75 sat- 
isfying the two equations is 71. The only candidate for a = 21 (mod 25) below 
100 is 96 and that fails a = 3 (mod 4). It follows that 71 is the only solution to 
a? = 11 (mod 100). We conclude that the probability of randomly hitting on a solution 


1 
toa? = 11 d 100) is —. 
oa (mo ) is 100 


Solution 2 
Sriram Natarajan 


The number ends with 1. The form is (10x + 1)? or 1000? + 300x? + 30x + 1. 30x 
fixes the tenth place and x = 7 is the unique integer solution. So, 713 = 11 (mod 100) 
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and is the only number below 100 with this property. The probability of hitting it 


e aig 
mas oT 


Solution 3 
Jim Henegan 


Since z? = 11 (mod 100) implies z? = 1 (mod 10), we may begin by narrowing our 
attention to x € {1,11,21,...,91}. 
Now make the substitution a = x — 1. Then we must solve 


(a +1)? = 11 (mod 100) 
for a € {10,20,...,90}. Note that, for any such a, this congruence becomes 


a? + 3a” + 3a+1=11 (mod 100) => 


3a? + 3a = 10 (mod 100). 


Now make the substitution b = Te Then, for b € {1,2,...,9}, we must solve 


3(10b)? + 3(10b) = 10 (mod 100) —> 
30b = 10 (mod 100). 


By inspection, this yields b = 7, which gives a = 70, which gives x = 71. 


Solution 4 
Amit Itagi 


x? = 11 (mod 4) > x? = 3 (mod 4) = x =3 (mod 4) 
xr? = 11 (mod 5) > z? = 1 (mod 5) > x=1 (mod 5). 


The only numbers less than or equal to 100 that satisfy these two conditions 
belong to the set P = {11,31,51, 71,91}. These numbers are respectively 11, 6, 1, 
21, 16 (mod 25). The only case that satisfies x? = 11 (mod 25) is x = 21 (mod 25). 
Thus, x = 71 (mod 100). 

The required probability is 1 part in 100. 


Note 1 
Samuel Walters, Jim Henegan 


Interestingly, not every residue modulo 100 belongs to an integer cube. Being able to 
select only among those that do will increase the probability of hitting on the solution. 
Indeed, there are only 62 distinct residues modulo 100. 

There are 40 residues that are attained by unique numbers below 100. These are 
obtained by numbers ending with 1, 3, 7, 9. There are 20 residues with the second 
digit even, non-zero. These are attained by pairs of numbers with the difference of 
50 between them. E.g., 323 = 32768 and 82° = 551368. Finally, the ending of 25 is 
attained by 5, 25, 45, 65, 85 and the ending of 75 by 15, 35, 55, 75, 95. 

The first plot shows frequencies at which the last two digits of n? are x, for n € 
{1,2,...,100}. Note the two 5-tuples, 20 pairs and 40 singles. Here the two 5-tuples 
occur when x = 25 and x = 75. 
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The second plot shows the frequencies at which the last three digits of n? are z, 
for n € {1,2,...,999, 1000}. Here the four 25-tuples occur when xz = 125, £z = 375, 
x = 625 and x = 875. 


frequency 
5 u 


last two digits 
PON N eae OA ee ey ee O N O dodo 4 ah 


0 20 40 60 80 100 of n? 


frequency 
25r 


20} 


15F 


10F 


last three digits 
0 200 400 600 800 1000 of n° 


Note 2 
Alexandre Borovik 


Luckily, in this riddle we have a natural probability measure. Interestingly, for the 7", 
11t? and 13t? powers, the answer is the same. 


Illustration 
N.N. Taleb 


1 
Too’ one has to further assume that the support [0, n] is 
either very large or n is a multiple of 100. Now, if the sample space is [0, 150], the 


For the probability p to be 


1 2 
probability is , not ——. If the sample space is [0,171], p = ——. Hence, in general, 


150 100° 171 


n + 100. Lin mod 100] >71 — (n mod 100) 
100n , 


p= n > 100. 


Let us visualize: 


n + 100 Boole[Mod|m, 100] > 71] — Mod][n, 100] 
100n 
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DiscretePlot[{.01, p}, {n, 100, 1500, 1}, PlotRange — All, AxesLabel > 
{n, "Pr"}] 


Pr 
0.011 
0.010 
0.009 
0.008 


0.007 
200 400 600 800 1000 1200 1400 


DiscretePlot[{.01, p}, {n, 100, 10°, 1}, PlotRange — All, AxesLabel > {n,"Pr"}] 


0.010010 
0.010005 
0.010000 
0.009995 
0.009990 


0.009985 


n 
000 60000 80000 100000 


Odds and Chances in Horse Race Betting 


ai bi 


Let there be n horses in the race. If the better bets : ; 
ai+bi ay +d; 


for (i =1,2,...,n), 


against the it horse winning, then, if the kt? horse wins, the better wins ak + bk and 
loses 
iak +b 4 aitbi akt bk 


Therefore, the better’s winnings are 


Qk i b; bk = bi 
=l 
ak + bk ren te) ae 


i=l i=l 


which is positive and the same whichever horse wins. 
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Acting As a Team 


Solution 1 


3 
1 1 
Each player chooses a color randomly. The probability of success is P = (5) =. 


Solution 2 


There are eight possible combinations of hat colors. Each player has to choose between 
R(ed), B(lue) or P(ass), making 27 possible team answers. Each player makes a ran- 
dom choice between three possibilities. 

If all pass, the team loses. With two passes, there are six possible answers, each 


1 
with probability of 3 of a correct guess. With one pass, there are 12 possible answers, 
1 
each with probability of 1 of a correct guess. With no passes, there are eight possible 


1 
answers, each with probability of 3 of being right. The total comes to 


1 1 1 1 

P=— zI -64 v12 : 

a (0145-647 -12+ 5-8) 
343841 =7_7 1 


27 277 28B 4 


Solution 3 


Strategies 1 and 2 are stupid: the fewer people make a guess, the higher the proba- 
bility of success. However, in order to win, at least one guess ought to be made. The 


1 
probability of success is z 


Solution 4 
Elwin Berlekamp 


1. If you see two hats of the same color, guess the other color. 


2. If you see two hats of different colors, pass. 


The results of the strategy are summarized in the following table: 


ABC A B C Outcome 
R RR GBW GBW GBW lose 
R R B pass pass GBC win 
R B R pass GBC pass win 
R B B GRC pass pass win 
B R R GRC pass pass win 
B R B pass GRC pass win 
BBR pass pass GRC win 
BBB GRW GRW GRW lose 
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In the table, GBW = guess blue: wrong, GBC = guess blue: correct, GRW = 
guess red: wrong, GRC = guess red: correct. The strategy gives six team wins out of 
ight: ie 
ate. = 7: 

This remarkable strategy has roots in the coding theory, especially in the field of 


protective codes Chapter 6], |102| pp. 123-126]. 


Sum of Two Outcomes of Tossing Three Dice 


2 on the third toss could be the sum of the outcomes of the first tosses in one way: 
2 = 1 + 1; 3 in two ways: 2+ 1 = 3 and 1 + 2 = 3 and so on. In all there are 15 
combinations of three tosses that satisfy this condition: 


(1,1,2)) (1,2,3)! (1,3,4) (1,4,5) (1,5,6) 
(2,1,3)| | (2,2,4) 1(2,3,5)| | (2,4,6) 

(3,1,4) | (8,2,5)) (3,3,6) 

(4,1,5) | (4,2,6) 

(5, 1,6) 


The three tosses are independent, implying that all 15 outcomes in the table have 
equal probabilities. Those that include at least one 2 are boxed; there are eight of 


them. Thus, the sought probability is = 


Chess Players Truel 


The simplest way to approach the riddle is by constructing an event tree. We are going 
to have three of them. The trees grow away from the root, where the root (first game) 
is one of the pairs AB, BC or AC. Each node is a game, except for the terminal 
ones shown in double circles. These are the winners of the tournament who collected 
two wins first. For the first two trees we will find the probability of B being the 
winner. These will be denoted Pag and Pgc. We assume the outcomes of all games 
are independent. 
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The task is to compare the two expressions P4g and Pgc assuming p,q,r > .5 and 
r > p. In the following we will prove that, always, Pag < Pgo. The result is imminently 


plausible as it seems quite reasonable for B to start with playing the weakest player 
C first. 


So here are the two expressions to compare: 
1. Pag =p: (1-r)-q:(1-p)+(1-p) ol) > Og) are Cl p), 


2. Pao =q~(1—p) +¢-p-(l—r) «q+ (1—@):r-U—p)-@. 


In the difference Pgc — Pap, the common term (1 — p) - q cancels out, leading to 
Pgc — Pag = [p-q: (1-r)+(1-p)- 0-4): r]:(@+4-1) 


which, since p + q > .5 +.5 = 1, is positive. Note that this is true even without the 
condition r > p. 


Two Loaded Dice 


Solution 1 


We want to show that, if pz, is the probability of getting k on top, k = 1,...,6, then 


1 
pi tps +p3+pi+ps +p > 5 
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What we know is that pı +---+ pg =1. 
By the Cauchy—Schwarz inequality p7, 


1 = (pı + p2 + ps + pa + ps + pe)? 
=(1-pı+1-p2+1-p3+1-p4+1-ps+1- pe)? 
< (1? +1? +1? +1? +1? +1?) (p? + p2 +p? +p? +p? +p?) 
and the required inequality follows. 


Solution 2 

Kriteesh Parashar 

Let {p1, p2, p3,---, Pe} be the probability of each number on the die for both dice. Then 
pi + p2 +--+: +pe6 = 1, and P(doubles) = p? + p3 +--+ + pê. By Jensen’s inequality 
[i7], since x? is convex, 


Pidoubles) 


ua i =) 


a G ) 
t 
6` 


| v 


P(doubles) = 


Solution 3 
Amit Itagi 


Let p; be the probability that a die turns up i: 


1- (5n) Ee) = Dat? D prn 


DK JAK 


<Sopi+ X (pi +z) = RFD ee 
i j,k, jżk 
1 


Thus, th bability of a double i ASi: 
us, the probability of a dou cis Une 5 


Solution 4 
Jacob Janssen 


You can as well throw one die twice. Let py be the probability of getting x with one 


die. Then > p> is minimal when the largest p2 is minimal as x? convex, which is when 


T 


1 1 
z = z for all z. So > te 2 g: 
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Solution 5 
Tan on Twitter 


111111\" 
Let = ( 5,555,555 ) . Note that Tē = 


1 ee 
6’6’6'6'6'G ai ike a 


Notes 

For n > 1 there are 6” equiprobable outcomes of which six are n-tuples of equal tops. 
Thus, the probability—in case of fair dice—of all dice showing the same number is 
6 1 


6" = 6-1 i 
For loaded dice with the same probability distribution, we may invoke Hölder’s 
inequality b]. With k = 2 ; 
n-—1 


or 
6 
iz 67-1 ; (Se) , 
i=l 
and, finally, 
6 
Dor 2 67-1 
{=l 


Crossing Bridge in Crowds 
G.D. Kaye 


If there is nobody on the bridge at noon, no one has entered it in the five-minute interval 
before noon. Since there are 144 intervals of 5 minutes in 12 hours, the probability that 


1 
an individual enters the bridge in a specific one is I The probability that none of 


the 1000 individuals enters in that interval is 


1000 
1 1000 1 144] T44 a 
1- — = 1- — ze T 0. í 
( m) ( m) e 18 ~ 0.00096 


Chapter 5 


Geometric Probability 


Whatever way uncertainty is approached, probability is the only sound way 
to think about it. 


Dennis Lindley, 2006, Understanding Uncertainty 


Random events that take place in continuous sample space may invoke geometric 
imagery for at least two reasons: the nature of the problem or the nature of the solution. 

Problems that fall under the category of Geometric Probabilities mostly relate 
geometric measurements of one set to those of another. Sometimes the measurements 
come as a result of advanced calculations, but often comparison of the measurements 
submits to reasonably educated intuition. 


Riddles 


5.1 Three Numbers 
Mathcount 2018 


What is the probability that three randomly drawn real numbers between 0 and 1 have 
a sum less than 1? 
Express your answer as a common fraction. 


5.2 Three Points on a Circle | 
Problem 244], Problem 3.12] 


Three points A, B, C are placed at random on a circle of radius 1. What is the 
probability for A ABC to be acute? 


5.3 Three Points on a Circle Il 

Problem A6 (1992)| 

Three points are randomly chosen on a circle. What is the probability that the triangle 
with vertices at the three points has the center of the circle in its interior? 


5.4 Two Friends Meeting 


Two friends who take the metro to their jobs from the same station arrive to the station 
uniformly randomly between 7 and 7:20 in the morning. They are willing to wait for 
one another for 5 minutes, after which they take a train whether together or alone. 
What is the probability of their meeting at the station? 


74 Cut the Knot: Probability Riddles 


5.5 Hitting a Dart Board 
Mathcount 2018 


Setzer throws a dart that lands within one of the 24 numbered regions on the dartboard 
shown and then rolls a standard six-sided die. 


What is the probability that the number of the region his dart hits is the same 
as the number he rolls on the die? What is the most likely coincidence? Express your 
answers as common fractions. 

Assume that the probability of hitting a region on the dartboard is proportional 
to its area and that the small circle and the rings have the radii 1, 2, 3 and 4. 


5.6 Circle Coverage 


Given a circle and n of its semicircles chosen at random with no chosen semicircles 
sharing their endpoints, what is the probability that the circle is completely covered 
by the semicircles? 


5.7 Points in a Semicircle 


Problem 10] 


Choose n points at random (uniformly and independently) on the circumference of a 
circle. Find the probability p, that all the points lie on a semicircle. (For instance, 
pi = p2 = 1.) 

More generally, fix ¢, 0 < ¢ < 27, and find the probability that the n points lie on 
an arc subtending an angle ¢. 


5.8 Flat Probabilities on a Sphere 
Problem 305], Randy Schwartz of Schoolcraft College, MN 


You are to select two points on a sphere and offer a friend to select a third one. If the 
triangle (plane) so obtained is acute, you win; otherwise, you lose. 
Which selection of the first two points maximizes your chances of winning? 


5.9 Four Random Points on a Sphere 
Problem A6 (1992)] 


Four points are chosen at random on the surface of a sphere. What is the probability 
that the center of the sphere lies inside the tetrahedron whose vertices are at the four 
points? (It is understood that each point is independently chosen relative to a uniform 
distribution on the sphere.) 
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5.10 Random Numbers and Obtuse Triangle 
Chapter 1] 


Two numbers « and y are uniformly distributed on [0,1]. What is the probability that 
sides of length x, length y and length 1 form an obtuse triangle? 


5.11 Random Intervals with One Dominant 
N.N. Taleb, as a follow-up to Riddle [3.6] on page [16] 


2n > 4 points are chosen uniformly randomly on the interval [0, 1], say, X1, X2,..., Xen 
and, for 1 < j < n, let J; be the closed interval with endpoints X2;_1 and X2,;. Find 
the probability that one of the intervals J; is dominant, i.e., includes all other intervals 
as subsets. 


5.12 Distributing Balls of Two Colors in Two Bags 
Problem 2.3.9] 


10 black and 10 white balls are placed into two bags. A bag is chosen at random and 
a ball is drawn from that bag. 

How should one distribute the balls between the bags as to maximize the probability 
of drawing a white ball? 


5.13 Hemisphere Coverage 


Section 5.10] 


Given a sphere and n of its hemispheres chosen at random, what is the probability 
that the sphere is completely covered by the hemispheres? 


5.14 Random Points on a Segment 


Problems 42-43] 


1. A point is dropped randomly on a unit segment. It divides the segment into 
two pieces. What is the average length of the shortest piece? The longest one? 
Their ratio? What is the expected value of the ratio of the lengths of the longest 
segment to the shortest one? 


2. Two random points, chosen uniformly and independently on a unit segment, split 
the segment into three pieces. What is the average length of the left segment? 


3. Two points are dropped randomly on a unit segment. They divide the segment 
into three pieces. What is the average length of the shortest piece? The longest 
one? Their ratio? 


5.15 Probability of First Digit in Product 
Problem 165] 


Two d-digit numbers (whose first digits are not zero) are chosen randomly and inde- 
pendently, then multiplied one by the other. 


1. Let P4 be the probability that the first digit of the product is 9. Find jim Pa. 
— oo 


2. More generally, if n 4 0 is a decimal digit and P? is the probability that the first 
digit of the product is n, what is lim P}? 


d— oo 
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5.16 Birds on a Wire 


Mark Galecki, Marcin Kuczma 


Take a wire stretched between two posts, and have a large number of birds land on it 
at random. Take a bucket of yellow paint, and for each bird, paint the interval from it 
to its closest neighbor. The question is: what proportion of the wire will be painted? 
More strictly: as the number of birds goes to infinity, what is the limit of the expected 
value of the proportion of painted wire, assuming a uniform probability distribution 
of birds on the wire? 


5.17 Lucky Times at a Moscow Math Olympiad 
Problems A4 and A5] 


1. A moment of time is lucky if the hour, the minute and the second hands are in 
this order clockwise starting with the hour hand. A moment is unlucky if the 
order of the hands clockwise is hour, second, minute. Which is more probable? 


2. A moment of time is lucky if the hour, the minute and the second hands all lie on 
the same side of some diameter of the clock face and unlucky otherwise. Which 
is more probable? 


5.18 Probability of a Random Inequality 
Problem 1985-7] 


Let a and b be positive constants with b > 1. Given that x + y = 2a and that x is 
uniformly distributed on [0, 2a], find the probability that 


(b? — 1)a? 


ry > p2 


5.19 Points on a Square Grid 
A. Bogomolny 


n points are chosen randomly at the nodes of a square grid. The points are joined 
pairwise. What is the probability that at least one of the midpoints of the so-obtained 
segments lies at a grid node? Consider n = 2,3,4,5. 


5.20 A Triangle out of Three Broken Sticks 
[103] Problem 96] 


Three sticks of equal length are each uniformly randomly broken into two pieces. 


1. What is the probability that from the three left pieces of the three sticks it is 
possible to form a triangle? 


2. What is the probability that from the three longest pieces of the three sticks it 
is possible to form a triangle? 


3. What is the probability that from the three shortest pieces of the three sticks it 
is possible to form a triangle? 


1 
4. What if the pieces are chosen randomly with the probability of z for each stick? 


5. What is the expected number of triangles that can be so formed? 
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5.21 Probability in Dart Throwing 
Problem 1999-5] 


What is the probability that a dart, hitting a square board at random, lands nearer 
the center than an edge? 


5.22 Probability in Triangle 
A. Bogomolny 


Point P is uniformly distributed in A ABC. AA’, BB’, CC’ are the cevians through P. 
K, L, M are the midpoints of AA’, BB’, CC’, respectively, Figure [5.1] 
Find the probability that P is within A KLM. 


B A’ 
Figure 5.1: Probability in triangle. 
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Solutions 


Three Numbers 


Solution 1 
A. Bogomolny 


The shape described by 0 < z,y,z < land «+y+z < 1isa pyramid with a triangular 
base defined by three points, (0,0,0), (0,1,0) and (0,0,1), and the apex at (0,0,1), see 
Figure 

1 


1 1 
The volume of that pyramid is 37 1. (5 ‘1. 1) =y This is also the volume of 


the pyramid relative to that of the cube, which is 1. 


Figure 5.2: Three numbers sum to less than 1. 


Solution 2 
N.N. Taleb 


The probability density function of the uniform distribution on [0, 1] is 


fa) = t x € [0,1] 


elsewhere. 


Given two random variables, the probability density function of their sum is the 
convolution of their probability density functions. It is the same for three: 


s= f te-v( [7 Itu- od) ay 


| x 
S 
A 
x 
A 
= 


(—3 + 6z — 222), 1<z<2 


II 
NI=NI= wv 


(9 -—6z+ 27), 2<2<3 


elsewhere. 


© 
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Due to the condition x + y +z < 1, we are only concerned with the distribution 


1.2 
over [0,1] such that the sought probability can be found from Fade =-. 
0 


Solution 3 
Thamizh Kudimagan 


I recast the riddle as follows to help me visualize it better: if a lizard going up a wall 
takes three consecutive steps, each of with a random value between 0 and 1 unit of 
length, what is the probability it will end up traveling less than 1 unit of length? 

The lizard’s first step has a box car probability distribution function between 0 and 
1. For each of the values of its first step, you can add the second step. The second step 
by itself has a similar box car probability distribution function. 

The sum of the first two steps has a probability distribution function that can be 
visualized by sliding the box car probability density function of the second over that 
of the first and noting the overlapping area, which increases linearly until 1 and then 
decreases linearly until 2—basically convolution. 

The probability density function of the sum of the three steps (again, convolution) 
can be visualized by sliding the box car probability density function of the third step 
on the linear probability density function of the sum of the first two steps and noting 

2 
how the overlap area evolves. Its area is under = for 0 < x < 1, so the answer is A 


To elaborate on the last part, the probability density function for the sum of the 

three steps (let us call it x for now) is roughly bell-shaped between 0 and 3 but is just 
2 2 

S in the range 0 < x < 1. Thus, P(x < 1) is the area under > for 0 < x < 1, so the 


answer is e 


Three Points on a Circle | 


Solution 1 

Fix point C. The positions of points A and B are then defined by arcs a and 6 extending 
from C in two directions. A priori we know that 0 < a + < 2r. The values of a and 
b favorable for our situation (as subtending acute angles satisfy) are 0 < a < m and 
0 < B < v. Their sum could not be less than m as this would make angle C obtuse. 
Therefore, a+ 8 > a. The situation is presented in Figure [5.3] where the square has 
the side 27. 


Figure 5.3: A region that yields an acute triangle. 
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Region D is the intersection of three half-planes: 0 < a, 0 < 6 and a+ < 2z. This 
is the big triangle in Figure The favorable events belong to the shaded triangle 
which is the intersection of the half-planes a < 7, 8 < m and œ + p > 1. The ratio of 


1 
the areas of the two is obviously x 


Now observe that, unless the random triangle is acute, it can be thought of as 
obtuse since the probability of two of the three points A, B, C forming a diameter is 
0. (For BC to be a diameter, one should have a + 8 = 7, which is a straight line, with 
0 as the only possible assignment of area.) Thus, we can say that the probability of 


3 
A ABC being obtuse is T For an obtuse triangle, the circle can be divided into two 


halves with the triangle lying entirely in one of the halves. It follows that 5 is the 


answer to the following question: 
Three points A, B, C are placed at random on a circle of radius 1. What is the 
probability that all three lie in a semicircle? 


Solution 2 
credits V.V. Fok and Yu.V. Chekanov with the following argument 


We shall select a triangle in two steps. First, we get three points A, B, C and their 
antipodes A’, B’, C’ across the center of the circle (Figure [5.4p. Selecting a triangle 
consists of selecting one of the points in each of the three pairs A, A’, B,B’, C, Œ. 
There are eight possible combinations. 


Figure 5.4: How to find an obtuse triangle. 


With each of the six points, we associate the semicircle around its antipode. Three 
points form an obtuse triangle if and only if they lie in a semicircle, and this is only 
if their associated semicircles have a nonempty intersection. The diameters of the 
associated semicircles form six sectors. Each sector serves as an intersection of the 
associated semicircles for exactly one selection of the vertices, meaning that out of 
eight possible selections there are always six obtuse (and two acute) triangles. The 


probability that one is obtuse is therefore : = - 
1 
It follows that the probability of a triangle being acute is 1 — : =y 
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Three Points on a Circle II 


Solution 1 
A. Bogomolny 


We shall reformulate the riddle and then refer to another one, Riddle [5.6]on page 
For three uniformly random points on a circle these two statements are equivalent: 


1. The semicircles of which the three points are the centers cover the circle. 


2. The triangle with vertices at the three points has the center of the circle in its 
interior. 


Indeed, consider a borderline case where two of the points are collinear with the 
center of the circle. Denote the points A, B, C and the center of the circle O (Figure[5.5). 
Assume that, say AC, is a diameter of the circle. 


> 4 


Figure 5.5: A triangle inscribed in a circle may or may not include the center. 


Point B is on one of the semicircles defined by AC. Let D be the midpoint of the 
semicircle defined by AC that does not contain B. Then D is outside the semicircle 
defined by B. Shifting A and/or C toward B will leave D exposed, not covered by any 
of the three semicircles. Simultaneously, the center O will drop out of A ABC. 

Now, with reference to the problem of covering a circle with three semicircles, the 


1 
probability of having O in the interior of A ABC equals 7 


Note 


The two statements are also equivalent to Riddle [5.3]on page [73]about the probability 
of a triangle with vertices at the three points having the center of the circle in its 
interior. 


Solution 2 
A. Bogomolny 


The center O is inside A ABC if and only if that is an acute triangle, which means that 
all three central angles AOB, BOC, COA are less than 180° (Figure |5.6). It follows 
that their semicircles intersect pairwise and thus cover the whole of the circle. 

Now, with reference to the problem of circle coverage with three semicircles, Rid- 


1 
dle bolon page the probability of having O in the interior of A ABC equals T 
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Figure 5.6: Three semicircles that completely cover the circle. 


Solution 3 
Achraf on Twitter 


Let us say that O is the first point we generate. This will be our point of reference. From 
this point, we will use the polar coordinate system (counterclockwise) to determine the 


position of A (our second point, with OCA = a) and then the position of B (our third 


point, with OCB = B). Let C be the center of the circle. 
Let us call A and B the two random variables (uniform on [0, 27]). Let us distinguish 
between two cases (there are two letters B in each graph for the two “extreme” letters 


B, see Figure [5.7): 


1. If0 < a < 1, then the 8 that allow C to be in the triangle are those in [7,7 +a]. 


2. If <a < 2r, then the £ that allow C to be in the triangle are those in [a—7, 7]. 


A 


CASE 1 CASE 2 


Figure 5.7: Two random points on a circle. 
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The probability p that C is inside the triangle OAB is (with convenient notations 
for probability density functions): 


(where we used the change of variable 6 = 27 — a). 


Two Friends Meeting 


In a Cartesian system of coordinates (s,t), a square of side 20 (minutes) represents all 
the possibilities of the morning arrivals of the two friends at the metro station. 

The gray area A, Figure[5.3| is bounded by two straight lines, = s+5 andt = s—5, 
so that inside A, |s — t| < 5. It follows that the two friends will meet only provided 
their arrivals s and ¢ fall into region A. The probability of this happening is given by 
the ratio of the area of A to the area of the square: 


15 15 
wo- (15x $ +15x 5) 175 7 
400 400 16 


Figure 5.8: Two friends meeting. 
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Hitting a Dart Board 
Mathcount 2018 


With a reference to Figure 5.9] if the probability of hitting a region is proportional to 
its area, then the odds of hitting one in the outer ring to those of hitting one in the 
middle ring to those of hitting one in the central circle are 5:3: 1. 


Figure 5.9: Probabilities on a dart board. 


1 14 
Summing up the probabilities for each number, we get P(1) = = (2) = 79" 
12 14 10 10 


As the die is fair, the probability of coincidence is proportional to the probability 
of hitting a number on the dartboard. 2 and 4 have the highest probabilities of being 
hit so that they are most likely to hit the outcome of the die roll. 

For the first question, the distribution on the dartboard is inconsequential. Once 


1 
a number is hit, there is the same probability of 6 that it will be matched by the 


outcome of a die roll. 


Circle Coverage 


Each semicircle is bounded by two diametrically opposite points. The n pairs of points 
divide the circle into 2n regions (which can be proved by induction.) A point on the 
circle may belong to a given semicircle or to its complement. Hence, the probability 
that a given point on the circle is in one of the semicircles is 2~'. The probability that 
it is covered by none of the n semicircles is 2~”. Further, a point belongs to exactly one 
of the 2n regions. If the point is not covered by the semicircles, neither is the region it 
belongs to. Thus, the probability that at least one of those regions is not covered by 
the semicircles is 2~"(2n) = 2~"*!n. The probability that the entire circle is covered 
is then 1 —27-"*!n. 

For n = 1, the formula gives the probability of 0, which is obviously true because a 
single semicircle only covers half a circle, not the whole of it. For n = 2, the probability 
is also 0, although two semicircles with the same endpoints cover the entire circle. The 


1 
probability of this is clearly 0. For n = 3, the formula gives the probability T 
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Points in a Semicircle 


Solution 1 to Question 1 


Associate with every point X on a circle the semicircle arc of which X is the midpoint. 
We claim that a set of points ¥ on a circle lie in a semicircle only if the union of the 
associated semicircles does not completely cover the circle. 

Let (C) be the circumference of a circle (to avoid ambiguity); ¥ = {Xp : Xp € 
(C), k=1,...,n}, n > 1 and is an integer. The following two statements are equiva- 
lent: 


1. There is point P € (C) such that ¥ C P. 
2. (C)\L (Xe #0). 
k=1 


Thus, the original problem is equivalent to Riddle [5.6]on page [74] where the prob- 
ability was found to be 1 — 2~"*'n for the semicircles to cover (C) completely. This 
implies that for a set ¥ of n random points on (C) the probability of them all being 


covered by a semicircle is nat" 


Solution 2 to Question 1 
Joshua Jordan 
Case 1 


nis 1 or 2. 
By inspection, the answer is 1. 


Case 2 


n> 2. 

Let P be any set of n > 2 points distributed uniformly at random on the circumfer- 
ence of a circle. With probability 1, the n points are distinct. For any point p € P, let 
S(p) be the semicircle starting at p (inclusive) and continuing in the counterclockwise 
direction. If all n points lie on the same semicircle, then there exists a point p € P 
such that P C S(p). 

Choose a point p € P arbitrarily, and let q be any one of the other n — 1 points. 


1 
The probability that q € S(p) is 5 Therefore, the probability that S(p) contains all 
1 n—1 
the other n— 1 points is (5) . Since, by definition, S(p) contains p, the probability 


1 n-1 
that S(p) contains all n points, that is that P C S(p), is also (3) i 


When n > 2, there is at most one point p € P such that P C S(p). Therefore, we 
can sum the individual probabilities for each point p € P to obtain the probability 


n—1 
that there exists any p € P such that P C S(p). This sum is n (5) . Therefore, 


1 n—-1 
the probability that all n > 2 points lie on the same semicircle is n (5) ; 
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Combining Cases 


1 n-1 
Note that n (5) = 1 when n is either 1 or 2. That is the same answer obtained in 


case 1. Therefore, the expression from case 2 gives the correct result for any positive 


n-1 
integer n, and the probability that all n points lie on the same semicircle is n (5) i 


Solution to Question 2 
Follows Josh Jordan’s reasoning 


We make an assumption that ¢ < 7. 
For a point X € (C), let &(X) be the arc subtending angle ¢ that extends coun- 
terclockwise from X, inclusive of X. The probability of a point uniformly distributed 


on (C) to fall into ®(X) equals 7 
T 
Given n > 2 points X1, X2,..., Xn, the probability that X; € ®(X;,), i Æ k, is the 
n—i 
same 2 and, for all i Æ k, is ba : 
20 2m 


For different k, the events Ep: (Vi 4 k)X; € ®(X;) do not intersect, since X; € 
(xp) is incompatible with Xp € 6(X;). It follows that the probability of the event 


n n-1 
5 Ex equals n (=) ; 
2T 


k=1 
Now, if for some point Y, (Vi) X; € ®(Y), then, for X; nearest to Y in the counter- 
clockwise direction, (Vi Æ k)X; € ®(X;,). As we found, this event has the probability 


n-1 
of n () ; 
2T 


Flat Probabilities on a Sphere 


Suppose you choose two points with central angle a. By symmetry, we may assume 
they have coordinates (0, b,c) and (0, b,—c). Figure shows a 2D cross section. 

You lose your bet if your friend selects a point in one of three caps: above the plane 
z = c, below z = —c or to the right of y = b. 


Figure 5.10: A 2D cross section. 
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Thus, we need to compute the areas of the three spherical caps shown in Figure[5.1]| 
We do that using a theorem discovered by Archimedes: 

Note that if two parallel planes h units apart slice a sphere of radius r, the surface 
area between the planes is 27rh. 

Using these calculations, we conclude that the probability that you win the bet is 


a 1 a 
= si 1j: 
pla) sin (5) ws (cos (5) 
The function attains its maximum for a, = 2arctan(2) ~ 126.9°. In addition, 


1 V5-1 


plao) = Y = 2 ` 


Steve Phelps created a beautiful dynamic illustration for the riddle that is the 
source for Figure 


Figure 5.11: Flat probabilities on a sphere: a 3D view. 


Four Random Points on a Sphere 


Solution 1 
Problem A-6] 


Four points A, B, C, D define a system of barycentric coordinates in the space such that 
every point E can be uniquely expressed as a linear combination with real coefficients 


B=wA+2B+yC+2zD, wtetyt+z=1. 


Point E lies in the interior of the tetrahedron ABCD if and only if all the coefficients 
are positive. Thus, if O stands for the origin and serves as the center of the sphere, the 
riddle inquires of the probability that O = wA+2B+yC+2zD, with w+a+y+z=1 
and w,x,y,z > 0. 

Let us fix one of the points and denote it P. The other three points will be denoted 
P1, P2, P3 and will each be selected in two steps. First we select three random diameters 
Qi + Qi2, i = 1,2,3. Next, we randomly pick one of the endpoints of each of the three 
diameters and take them for P1, P2, P3, respectively. There are 23 = 8 ways to perform 
the second step. The three diameters define eight tetrahedra. Observe that since, for i = 
1, 2,3, Qi1+Qi2 = 20, the coefficients in O = wP+aP,+yP2+2zP3 change signs every 
time one of the P’s is replaced with another endpoint of the corresponding diameter. 
It follows that, disregarding the cases of zero probability, like that one of the faces of 
the tetrahedron passes through the origin, only for one out of eight tetrahedra do all 
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the coefficients have the same sign and can be chosen positive. Thus, the probability 


1 
of including the origin is 3 for any three random diameters. It follows that the sought 


1 
probability is 5 


Solution 2 
A. Bogomolny 


Let us associate with every point X on the sphere S the semisphere X that has that 
point as its center. We claim the following: 

A tetrahedron ABCD inscribed into a sphere contains the center of the sphere in 
its interior if and only if the four hemispheres P1, P2, P3, P4 cover the sphere entirely. 

Let O be the center of the sphere and, for a point X on the sphere, let X be the 
half-space bounded by a plane through O that does not include X. If point P on the 
sphere is not covered by any of the four semispheres P;, i = 1,2,3,4, it lies in the 
interior of the four-sided cone with the apex at O: P € NL, P;. This means that all 
the angles POP;, i = 1,2,3,4, are obtuse, implying that P; € P, i = 1,2,3,4. In other 
words, the tetrahedron P;P2P3P, C P and, therefore, does not contain O. 

This means that if O € P;P2P3Py, then S C ea P;. We need to prove the converse. 

Assume O ¢ P;P2P3P,4. There exists a plane 7 that separates P1, P2, P3, P4 and 
O. The perpendicular from O to z crosses the sphere at two points; let P be the farthest 
of the two from r. Then all angles POP; are obtuse so that P ¢ UÎP:. 


1 
The answer 3 is now retrieved from the formula 1 — 2” (n? — n + 2), with n = 4, as 


the probability of n hemispheres covering the whole of the sphere. 


Random Numbers and Obtuse Triangle 
For x,y, 1 to form a triangle at all, one needs these three inequalities: 
l+a>y, l+y>z, t+y>l. 


The first two inequalities are satisfied automatically. Thus, we should only consider 
the last one, z +y > 1. 

Being the side of length 1 is the longest, the angle that lies opposite this side is 
the largest. It should not be acute or right, although the latter possibility has the 
probability of zero and can be disregarded. 

The angle in question, say a, is obtuse, cos& < 0, so that, from the law of cosines, 
£? +y? < 1. Graphically, the point (x,y) that satisfies x + y > 1 and z? +y? < 1 lies 
in the blue region shown in Figure 

The sought probability is the area of that region: 

1 r-2 


2 4 


T 
4 
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(0,1) 


(0.0) (1,0) 


Figure 5.12: Random numbers and obtuse triangle. 


Random Intervals with One Dominant 


Solution 1 
A. Bogomolny 


Our sample space consists of (2n)! permutations of the given point indices. An 
interval J; is dominant in 2(2n — 2)! cases, as the 2n — 2 indices can be arbi- 
trarily permuted between the two extremes. One of the intervals is dominant in 
n+ 2(2n — 2)! = 2n(2n — 2)! cases. Thus, the probability of there being a dominant 


i . 2n(2n — 2)! 1 
interval is = ; 
(2n)! 2n— 1 
Solution 2 
N.N. Taleb 
The riddle is reduced to finding adjacent Xmax and Xmin (since only the interval 
[Xmax; Xmin] covers all other intervals) in permutations of X1,..., Xən- 


Note that, given that 2j is even, to be compatible with the previous formulation of 
the riddle, we consider intervals [X1, X2] and[X3, X4] but not [X2, X3]. 


e Xmax followed by Xmin 


There are (2n — 2)! sequences of each 


Cars X mins X3, X4, cee , Xan) 
(Xi, Xa, Xmax; Xinin mais , Xan) 


lines 
(X1, Xo, dic is Hae max) Xmin) 
for a total of n(2n — 2)! 


e Xmin followed by Xmax 


By mirroring, same n(2n — 2)! 
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e Thus, 


Numerator = 2n(2n — 2)! 


Denominator(total permutations) = (2n)! 


1 


ee 


Thanks to Michael Wiener for correcting the count of the number of lines. 


Solution 3 
Michael Weiner 


The easier path is to consider that Xmin is equally likely to be paired with any of the 


other 2n — 1 values. Only 1 is Xmax- 


Solution 4 
Zhuo Xi 


1 
Locate the point on the left (minimum). It has 5 chance to be odd, with 


2n 


=l 


1 
probability that the next index point is maximum. Similarly, it has 3 chance to be 


1 
ith 
even with 5 


I chance the previous index point is maximum. 


Distributing Balls of Two Colors in Two Bags 


Let W1, W2 stand for the number of white balls placed into two bags; similarly, let 
Bı, Bz stand for the number of black balls. If P(W) is the probability of drawing a 


white ball, we have several examples: 


9 and Bz = 10. 


W, | Bi | We | B> P(W) 
1 1 1 1 1 
5} 5 5] 5 rae aa 
1 1 
10| 0 0| 10|}=-14+=-0==— 
2 i 2 
1 3 L. 7 1 
3) 7) 7) 8\5°m97 2 i032 
3| 3 7| 7 : ie ave 
22°22 2 
The best way to find the solution is to set W; = 1, Bı = 0, W2 
Then 
1 1 9 14 
BW) S55 ag = qq Ta 
1 
Figure 5.13 shows a graph of the function f(x,y) = ( E | 
2 \ 2 +y 


the region (0,10) x (0, 10). 
Here x corresponds to W1, y to By, etc. 
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For integer values, the maximum of f is attained for f(1,0) and f(9,10) which are 
practically the same distribution. 

The maximum probability could not exceed 75% because there is always a bag 
where at most 50% of the balls are white and another where at most 100% of the balls 
are white. 


Figure 5.13: Distributing balls of two colors in two bags. 


Notes 


1 10 — 
In Figure the function f(x,y) = 5 ( i + 50 7 ) appears to max out at 
x+y —a-y 


the corners (0,0) and (10,10). This is probably due to software interpretation of one of 


1 
the values, f(0,0), f(10, 10), as 5 the other as 1. From the viewpoint of probabilities, 
neither makes sense. Drawing a ball from an empty bag could not be assigned either 


1 
3 or 1 as a probability. 


Hemisphere Coverage 


For a start, let us observe that some configurations, like when, for example, three or 
more of the hemisphere-bounding great circles concur, may not fall in line with general 
reasoning, but such configurations occur with probability 0 and may be ignored. 

Each hemisphere is completely bounded by a great circle. The n great circles divide 
the surface of the sphere into n? — n + 2 regions (which can be proved by induction). 
Each great circle bounds two hemispheres. Hence, the probability that a given point, 
not on the great circle, is in one of the hemispheres is 271. The probability that it is 
covered by none of the n hemispheres is 27”. The probability that the point lies on 
one of the great circles is 0. Other than that, a point belongs to exactly one of the 
n? — n +2 regions. If the point is not covered by the hemispheres, neither is the region 
it belongs to. Thus, the probability that at least one of those regions is not covered by 
the hemispheres is 2~"(n? — n + 2). The probability that the entire sphere is covered 
is then 1 — 2-"(n? — n + 2). 

For n = 1, the formula gives the probability of 0, which is obviously true because 
a single hemisphere only covers half a sphere, not the whole of it. For n = 2, the 
probability is also 0, although two hemispheres, if bounded by the same great circle, 
cover the entire sphere. The probability of this is the same as that of having a random 
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point on the sphere fall on a given great circle, which is 0. For n = 3, the formula again 
gives zero probability. 


Random Points on a Segment 


Solution to Question 1 


1 
The expected value of each of the pieces, due to the symmetry principle, is 7 It is not 


so for the expected value of the shorter of the two. (This is a curiosity because one of 
the two is surely shorter!) 


1 
If the point has landed on number t, then, provided t < z the left piece is the 
shortest; otherwise, the shortest piece is on the right. Assuming the shortest piece is 


1 1 
the left one, its maximum length is z and the average is half that, i.e., T The same 


1 3 
number is found if t > z Then the average length of the longer piece is r and the 


ratio of the two is 1 : 3. This is the ratio of the expected values. The expected value 
of the ratio is different. 


1 
Assume t > z Then ¢ is the length of the longer piece; 1 — t is the length of 
1 
the shorter one. Since t is distributed uniformly on E 7 , the expected value of the 


ratio is 


1 = 
2 | = paola) 
1/2 t 


1 
= 2ln 2 — 1 #2 0.386. 
1/2 


The expected value of the ratio of the longest segment to the shortest one is infinite 
1 


for it is equal to the divergent integral 2 | dt. 


yj2l—t 


Solution 1 to Question 2 
Let L be the expected value of the length of the first piece. Since, according to the 
symmetry principle (Appendix|B}, the lengths of all three segments have the same dis- 
tribution, their lengths have the same expected value, namely, L. However, in any 
experiment, the lengths of the three pieces add up to 1. It follows that 3L = 1 
Į 
and L = -. 
3 
Solution 2 to Question 2 
Joshua Bowman 


Let the two points have coordinates x and y. Then the expected value of the middle 
segment is 


1 
J |z — y|dxdy = 3° 


[0,1] x [0,1] 


1 
By symmetry (Appendix, the expected values of the other two segments are also 7 
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Solution to Question 3 
As in question 1, the expected value of each of the pieces, due to the symmetry prin- 


1 
ciple, is = However, if we talk of the expected values of the shortest, middle and the 


longest segments, then all three numbers are different. 

I will use geometric insight. Assume that the left point is a distance X from the 
left end and the right point, a distance Y, such that X < Y. We can think of the pair 
(X,Y) as being uniformly distributed in the unit square, with the condition X < Y 
satisfied by the points above the main diagonal. 

The task is to compare the three lengths: X, Y — X and 1—Y. If X is the least 
one, then X < Y — X and X < 1- Y, i.e., 2X < Y and X +Y < 1. The area where 
the two conditions are satisfied is the dark triangle shown in Figure [5.14 


GD 


Figure 5.14: Random points on a segment 1. 


The average value of X is attained at the center of gravity of the triangle, that is 


1 1 1 1 
3 way from the base such that it is equal to aa 

Points (X,Y) that correspond to the case where X is the largest length lie in the 
quadrilateral marked in Figure 


1 


Figure 5.15: Random points on a segment 2. 
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To find the average value of X in the quadrilateral, it is split into two triangles. 


1 1 1 1 
For the right triangle, the average value is 5 + z for the left one, it is 37 Ig The 
areas of the triangles are in proportion 3 : 1, so that the average value of X in the 
quadrilateral is the weighted sum: 


1 1 1 1 4 8 
: | L1. er E 
i G 5) i G 5) AT Oou 


3+1 4 18° 
For the case where X is the middle length, we can shorten the computation by 


1 2 
subtracting from the total of 1 the two fractions, 9- Ts an 
follows that the expected lengths of the shortest to the middle to the longest segments 
are in proportion 2:5: 11. 


11 ee) 
d 18? to obtain I5 It 


Probability of First Digit in Product 


Solution 1 

We shall represent integers with d digits as N = m - 10%}, with 1 < m < 10. Then 
for two integers, Ni = mı: 10%! and M = mə. 10%7t, Ni No = mmo. 1024-2, 
with 1 < mmo < 100. The first digit of the product is n, provided Mmımə € 
[n,n + 1)U[10n, 10(n + 1)). 

The totality of the products m mz fills proportionally the bands defined by the 
hyperbolas zy = n and xy = 10n in the interior of the square (1,1) — (10,10). Thus, 
the sought limit probabilities (that I will denote P”) are the areas of those bands 
relative to the area of the square, which is 81, Figure 


zy =90 


Figure 5.16: Probability of first digit in product. 
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For n = 9, the calculations differ somewhat from the rest of the digits: 
97/4 10/4 10 
a= |f (2 o) a= f = 1) ae} + f (0-2) ae 

1 T £ 9 T 9 T 

9 10 

1 1 
e ee 
1 2 9 x T 


= In z|} + (9z — 80 1n z£) |, 


= 9 + 81 ln9 -— 801n 10. 


For n Æ 9, we will run the calculations piecemeal: 


10 
1 

[Area below xy = 10n] = U (n) = 9n +/ mule 
n T 


10 
= 9n + 10n In z|?’ = 9n + 10n ln —. 
n 


[Area below xy = n] = L(n) = l "de 
1 T 


= nina|, =nlnn. 
Then 
P” = [U(n4+ 1) —U(n)] + [L(n + 1) - L(n)] 
= 9nInn — (n+ 1) In(n+ 1) + 9+4+101n10. 


Solution 2 
Amit Itagi 


If the d-digit number is written in scientific notation, the significand is uniformly 
distributed between 1 and 10 (as d + ov). For illustrative purposes, 2936 is 2.936 x 10° 
in scientific notation; 2.936 is the significand. 

The mantissa of the logjọ of the number or the log,, of the significand has a 
probability density function of f(x) = C-10® over x € [0, 1), where C is a normalization 


In(10) 


1 
constant —;———— that turns out to be . If mı and mg are the mantissas for 


o L0*dx 

the two numbers being multiplied, the mantissa of the product is M = (mı + m2) — 
[mi +me2]|. If M € [logio D,log;)(D + 1)), then the first digit of the product is D. 

Figure shows the joint space of mı and mg. The partitions of the shaded 
regions are at mı + mz values of logo 2 — logy) 10 and (1 + log), 2) — (1 + logy, 19). 
The partitioned regions come in pairs where regions with the same color have the same 
value of M and thus the same first digit D. The red region has first digit 1, the blue 
region has the first digit 2 and so on until the last green region has first digit 9. Thus, 
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the probability of having digit D as the first digit is 


a (D+1) o “A D o 
m2=0 m,=0 m2=0 m,=0 
1 1 
m2=log;y D J/mi=1+log,;y D- M2 
1 1 
= / Í } Fem) fona)dmidm 
m2=log,9(D+1) Ym1=1+4+log;g(D+1)—me 


o1 
~ 81 


P(D) = { 


(D +1) n(D +1) — Din(D) — 1] 


+ ~ [D\n(D) — (D+ 1) In(D + 1) + 14+ In(10)] 


ol 
~ 9 


10 1n(10 
ey 


[DIn(D) — (D+ 1)In(D+1)+4+1] 31 


Figure 5.17: Probability of first digit in product, logarithmic scale. 


The computed values are shown in the table: 


Leading Digit | Probability 
1 0.241348 
0.183209 
0.145454 
0.117380 
0.095007 
0.076402 
0.060474 
0.046549 
0.034178 


O NANAON 
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Solution 3 
N.N. Taleb 


x,y © UniformDistribution[{Z,H}] and it is independent. The product has for its 
probability density function y(.): 


H? 
log (=) 
a H?>zAHL<z 
p(z) = 
z 
log (=) 


We have 


H? 
| p(z) = 1 (verification for paranoid people) 
L2 


10L? 2 
L*(1+ 9log(9) — 10log(10 
Af eles 2 ee = ieee) 
9L? (H - L) 
H? 
P, = (z)dz 
9/10(H+1)2 


3 
H? — 18H — 9(H + 1)? log(10) + 18(H + 1)? log E + 3) -9 


10(H — L)? 
We replace L = 10%t, H = 104-1 


Pa = (Pi + Pale 


3 
—9d+35d+2 4 1004 (180 log (Geo + 3) +9 —9log(9) — 80 tog(10)) + 100 


(10 — 9 102)” 


1 log(1 
lim Py = 9 + log(9) — BU ag (10 


doco 8 


= 0.0341776. 


Illustrations 
Marcos Carreira 


I looked at z being the fractional part of log,)(x) + logig(y), then the frequency 
distribution of |107|. The results were very close to the exact count for d = 2 


(Figures 5.21). 
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Figure 5.18: Sum of two logarithms 1. 


Figure 5.20: Sum of two logarithms 3. 
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Figure 5.21: Sum of two logarithms: contour plot. 


Birds on a Wire 


Solution 1 
Nathan Bowler 


The following argument, although not rigorous, gives the right answer and shows why 
it is true (and I guess with a bit of effort you could make it rigorous). Consider a 
wire with lots of birds on it and scale it up so that there is one bird per unit length. 
Consider a bird at b. I shall assume that b is far enough away from the ends that 
they make no difference. For any interval, the number of birds in that interval has 
about a Poisson distribution with parameter x. In particular, the interval is empty 
with probability 1 — e~*, and the distance to the nearest bird to the right has an 
exponential distribution with parameter 1. The same may be said for the nearest bird 
to the left. So the distance to the nearest bird overall has an exponential distribution 
with parameter 1 + 1 = 2. Without loss of generality [WLOG], the nearest bird at b’ 
satisfies b’ — b = x > 0. Then b is the nearest bird to b’ with probability e~”. 

Give bird b a paintbrush, and let it paint half of the interval (b,b + x) if it is the 
nearest bird to b’ and the whole interval otherwise. A simple combinatorial argument 
shows that if each bird on the wire does this, then the wire will be painted as in the 
riddle. So we are interested in the expected length painted by bird b since this is (after 
scaling) the required value. However, the comments in the first paragraph allow us to 
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evaluate it as a simple integral: 


This value is 0.38888888..., which seems to fit with the results produced by the 
applet. I must admit, at first I thought the answer was going to be something nice like 


e71, but a is what the algebra gives. 


Solution 2 
Mark Huber 


Here is another way of thinking about it: suppose that for a bird at position a on the 
wire we assign a random variable La that is the length of the yellow line to the left of 
a. So if a is closest to its bird on the right and if the bird to the left of a is closer to 
its left bird, La = 0; otherwise it is some positive number. 

What is the probability that La is in some tiny little interval around h? Or, in 
probability notation, that P(L, € dh) =? 

There are two ways La can be close to h. One case is when there is a bird at a—h, 
no bird in the interval (a —h,a) and no bird in (a,a+h). This occurs with probability 
(1 — 2h)"-2ndh. 

Another case is when there is a bird at a — h, no bird from (a — 2h, a) and at least 
one bird in (a,a +h). This occurs with probability ((1 — 2h)"~? — (1 — 3h)"~*)ndh. 

To find the expected value of La, this probability has to be multiplied by h, then 


1 7 1 
integrated for h from 0 to z This gives an expected value of about in n Since there 
n 


7 
are n birds, the total yellow line is expected to be I5 
A variant of the strong law of large numbers then completes the proof, showing 
that as the number of birds goes to infinity, the amount of line colored yellow is Ig 
Solution 3 
Moshe Eliner 


Let us pass through all birds, starting, say from the left, stopping at every fourth bird. 
I will call this segment 4b. Notice a delicate point: 

Denote the shortest connecting segments sc (i.e., yellow lines). Each four birds has 
three inner connecting segments that average 2. 
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1. The states of sc = 1 and sc = 3 force inner point positions on the neighboring 
(fourth) bird. 


2. On a uniform wire, birds tend to have uniform distribution, hence inner point 
positions cannot be predefined. 


3. There cannot be sc = 1 or three more than average. This implies that each four 
birds have on average sc = 2. 


Now it is enough to calculate the density when sc = 2: 

There are three possible sc = 2, but, since one can rearrange segments inside the 
four-bird groups, all have the same density. Let us choose one of them, e.g., the one 
with the two shortest connecting segments. 

Integrate ydxdy: 


1 2 
Li y from y = 5 to z 7 from 2y- 1tol-y 


1 
2. y from 0 to zy? from 0 to y. 
pia 7 
This yields —— 
108 
7 


is — 


18° 


1 
, but, since the total integration area is p the resulting density 


Solution 4 
Stuart Anderson 


Looking at these solutions, I was particularly struck by solution 3, which did not 
involve the notion of an exponential distribution at all. I found it somewhat hard to 
follow, so I started looking at the assumptions behind the other solutions to see how 
you could solve the riddle without the exponential distribution. 

Here is the surprising result: part of the solution REALLY IS independent of the 
distribution, although the final answer does depend on the distribution. 

Let L be the length of an interval between adjacent birds. Suppose that the 
P(L < x) = F(x) for an arbitrary cumulative probability function F. Then the density 
function f(x) = F'(x) gives the probability density for the interval to have length z. 

A given interval will be painted twice if it is “small,” i.e., shorter than both its 
neighbors; once if “medium,” i.e., shorter than one and longer than the other neighbor 
and not painted if “big,” i.e., longer than both neighbors. What is the probability of 
each case and its expected length? 

For the interval that is longer than its neighbors, the probability is 

oo 1 1 
f EO FE: Sed | (Par = 5. 
0 0 


Similar calculations for the other two cases, using 1 — F(x) in place of F(x) when 
we want the neighboring interval to be longer instead of shorter, give the same result. 
This is rather surprising in itself: regardless of the distribution, the probabilities P(big), 


1 
P(medium) and P(small) are all exactly 3" 
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The expected length of an interval that is longer than both neighbors is 


f EFE Fe): fede 
1 9 
(3) 
which we integrate by parts (very carefully!): let u = x, and dv = (F(x))?- f(x)dx = 
d((F(x))* — 1) 


3 
and this one allows the u - v term to vanish at oo.) Then we get 


. (We can choose any antiderivative we like when integrating by parts, 


3 oo saree = = — F3\dr 
(x (F -d - f E- na= f (1 — F°)d 


if F(x) approaches 1 sufficiently quickly for the leftmost term to vanish and the integral 
to converge. Let us call this number B (for “big” interval). 
A similar but simpler calculation shows that the expected length of an arbitrary 


co 
interval (no conditions as to its size relative to its neighbors) gives i (1— F)dz. Let 
0 


us call this number A (for “average” interval). 
Since all but the big intervals are painted, the expected fraction painted is 


= =1- When F(x) = 1 — e~** is the exponential distribution, 
3 1 
B= f (ec ke Ben RP ee SBP) lap = “273 = A 
while A = pee = z Therefore, the expected fraction painted is 
= 11 7 
i 3 a ag 8 
k 


2 
A more detailed calculation shows that I8 of the length is painted twice (small 


11 
intervals), ig is painted once (medium intervals) and I8 is unpainted (big intervals). 
Since these types of intervals occur with equal probability, this also shows that the 


ratio of mean sizes of small, medium and big intervals is 2 : 5 : 11. Of course, if 


1 
we took 3 of the intervals completely at random, we would expect them to cover 


1 — 

3 18 
S:M:B:A=+2:5:11:6. Therefore, medium intervals are slightly shorter than 
the average length. 


of the wire, so we can extend the ratio to include the overall average length: 
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Solution 5 

Bogdan Lataianu 

Let Uq) be the order statistics of uniform distribution, i = 1...n—1 and vr = uj—Uj_1 
with uo = 0. Then the probability density function of v1,..., Up for )>;_, vi < 1, as 
shown in {30}, is 


(n —1)! 
n—r-—1)! 


(=v... u) Tt r =1...n-— 1. 


fwi... Ur) = ( 


When r = 3, f(v1, v2, 03) = (n—1)(n—2)(n—3)(1—1 — v2 — v3)”74. Let Z; be the 
painted length of the it? interval after a realization of the order statistics and denote 
X =Vji_1, Y = V;, Z = Visi, A the event V; < Vj_1 or V; < Vi+ı. Then Z; = 14Y and 


E(Z;) = J Lavf(o,y,2)dedyde = i ET 


We have 
P(A)= P(Y < X)+ P(Y < Z)—- P(Y < X and Y < Z). 


Therefore, 
1/2 l1—y l1—r—y 
E(Z:) = (n—1)(n—2)(n —3) ( Í I n y(l—@—y—2)"“dededy 
0 y 0 


1/2 pl-—2y pl—av—y 
+ | f y(l— z — y — z)” tdzdzdy 
0 0 y 


1/3 pl-—2y pl—av—y 
— I Í J y(1— z — y — z)” tdzdzdy 
0 y Y 


DE gt 1O 7 
= 4n 4n Qf 18n 


In our case, suppose we have n birds. Then E(painted length) = E (two intervals 
7 
8n 


never painted) + E(two intervals always painted) + (n—3)E(Z;) = +(n—3) i 


n+l 
ith the limit i 
wi imi I5 


Lucky Times at a Moscow Math Olympiad 


Solution to Question 1 


For a moment defined as h : m : s, consider the moment 11 — h : 59 — m : 60 — s. The 
two constitute reflections of each other across the vertical diameter (the line 12-6), see 
Figure 

For example, 7 : 42 : 12 reflects into 4 : 17 : 48. The latter is lucky; the former 
is not. 


104 Cut the Knot: Probability Riddles 


Figure 5.22: Lucky times at a Moscow math olympiad 1. 


In general, a legitimate position of the three hands reflects into another legitimate 
position. However, one is lucky; the other is not. It follows that there is a 1-1 correspon- 
dence between lucky and unlucky moments. More than that, two discrete successive 
lucky moments, like say h : m : s and h : m : s + 1, correspond to two successive 
unlucky moments 11— h : 59— m : 60—s— 1 and 11—h: 59—m: 60—s. Thus, the two 
define one-second intervals corresponding to each other, though one is entirely lucky 
and the other is entirely unlucky. The argument could be expanded to intervals longer 
than one second. In general, the reflection of a lucky interval is an unlucky interval of 


1 
the same length, making the two events equiprobable: the probability of each is z 


Solution 1 to Question 2 
A. Bogomolny 


Consider any two hands, e.g., the hour and the minute hands, and the clock face sector 
formed by their extensions across the center of the face as in Figure 


Figure 5.23: Lucky times at a Moscow math olympiad 2. 


Whenever the second hand falls into that sector, the moment is unlucky; a moment 
is lucky if the second hand falls outside the sector. 

We can simulate the situation with two random points on the circle. Two points 
split the circle into two arcs and it appears that we are interested in their relative 
lengths. The unlucky sector is never bigger than 180°. 

Cut the circle at one of the points. The result is a segment with a random point 
inside. The point divides the segment into two pieces. In the solution of Riddle on 
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3 
page we found that the average length of the longer piece is ri of the whole segment 


1 
while that of the shorter piece is 3" 
Back to the original riddle, the lucky moments of a day come up with the probability 
3 
of T three times more often than the unlucky moments. 


Note that an objection may be raised as to the reasoning behind dropping of two 
points. If, as above we associate the two points with the positions of the hour and 
minute hands, we should admit that not all positions correspond to legitimate time 
moments. However, for any such position, the clock face can be rotated to make the 
hands show a real time. This leads to the idea of reducing the riddle to that on a 
segment. 


Solution 2 to Question 2 
Thamizh Kudimagan 


For angle 0 between the hour and minute hands, the moment is unlucky only if the 
second hand is in the diametrically opposite sector of angle 0, so it is lucky over 27 — 0 
out of 27, and 


_ ["2n-Od(0) 3 


Solution 3 to Question 2 
Thamizh Kudimagan 


In fact, the integration does not even seem necessary, based on the argument of 
Solution 2. 


e When hour and minute hands are closest, P(lucky) = 1. 


N| = 


e When hour and minute hands are farthest, P(lucky) = 
. T . 1 1 3 
So, P(lucky) for all hour and minute hands positions is J (1 + 5) =. 


Probability of a Random Inequality 


Solution 1 


Substituting y = 2a — x into the inequality gives 
b’a(2a — x) > (b? — 1)a?, or, 
b’ x? — 2ab*a + (b? — 1)a? < 0, or, 


(ba — (b — 1)a)(bx — (b + 1)a) < 0, or, 


(b— 1)a (b+ 1)a 
5 LTA 5 : 
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b+1 b-1 
The length of the interval where the given inequality holds is ere. Ja = 


b b 
2 
= The sought probability is the ratio of that length to the length of the interval of 
bigest bags : 2a 1 
the distribution of x, i.e., 2a. Thus, we have > +2a= 5 
Solution 2 
N.N. Taleb 


We have the transformation x ~ U[0,2a]. The equation z = ry = (2a — x)x has 
probability density function: 


1 
2) =. — = —— 5 
p(z) 2ava? — z 


It also has the survival function: 


0< z <a. 


2 
r 1 a2? — k 
P(Z>k) = f dz = ———_.. 
( ) k 2ava?-z a 


a? (o? — 1) 


b2 


1 
Replacing k with , we get a 


Points on a Square Grid 


Solution 1 
Given two grid points, their x coordinates have the same parity with the probability 


1 
of z It is the same for the y coordinates. Both coordinates have (pairwise) the same 


1 1 1 

parity with the probability of sa y 
For n = 3, for any combination of the coordinates of one point, there are six 
combinations of parities for the other two that do not lead to a midpoint at a grid 
node. For example, if one point has both coordinates even, denoted e/e, then for the 


other two there are the following possibilities: 


e/o, ofe,o/o 
o/e, o/o,o/e 
o/o, ofe,e/o. 
Since there are four combinations of parities for the chosen point, there are 4-6 = 24 


possible combinations that do not place a midpoint at a grid node. For three points, 
there are 4° possible combinations so that the probability of getting a midpoint on a 
: 2 5 
grid node equals 1 — 64 8 
For n = 4, there is no midpoint on a grid node if and only if the four points have all 
four distinct parity combinations: o/o, o/e, e/o, e/e. There are 4! ways to distribute 
these combinations between the four points, out of the total of 44 = 256 combinations. 


24 
This happens with the probability 356 ~ Es Thus, at least one of the midpoints falls 


32 
; i a 3 29 
on a grid node with the probability of 1 — 39 39. 
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For n = 5, it is shown elsewhere with the pigeonhole principle that, in this case, at 
least one of the midpoints falls onto a grid node. On the other hand, since there are 
only four possible parity combinations for a 2D grid point, if there are five points, the 
probability is 1 that two of them have the same combination of parities. This may be 
considered a probabilistic solution to an otherwise deterministic problem. 


Solution 2 
dash_amitabh on Twitter 


Each point has four equally likely parity combinations (odd/even on both coordinates). 
No midpoint will be a grid node when no two points have the same parity, for which 
the probability is 


4 
n=4, 1 Pe ae oe ea 
4 32 


1\> 
n=5, (3) -4.3-2-1-0=0 (unauthorized addition). 
Si ey : RETE 3 1 
The probability that a midpoint will fall at the node of the grid is 1 — aR 
3 5 3 5 
1 ggi! 33 apt tha), for n = 2,3,4,5, respectively. 
Solution 3 
Amit Itagi 


The nodes can be considered to have integral x and y coordinates. For midpoints to 
have integral coordinates (with probability denoted by P), the coordinates of both 
points should have same parity—even or odd (a requirement for both x and y coordi- 
nates). For each point, there are four possibilities (four bins) with equal probability: 
(e,e), (e,0), (0,e) and (0, 0). 

For n > 4, the pigeonhole principle guarantees that there will be at least two points 
in the same bin and P(n : n > 4) = 1. 

For the remaining cases, P is one minus the probability of assigning distinct bins 
(assign the first point to a random bin, the second point to one of the remaining three 
bins, the third point to one of the remaining two bins and so on). Thus, 


TT (4-8) 
P(n) = g 7 ag 


1 (n > 4). 
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Hence, 

3 1 
P(2)=1--=- 
(2)=1-5=3, 

3 2 5 
P(3)=1-~-- == 
(8)=1-5-2=5, 

32 1 29 
PASIE aa a 
P(5)=1 


A Triangle out of Three Broken Sticks 


Solution 1 to Question 1 

A. Bogomolny 

Assume the sticks are of length 1 and consider the cube with vertices A(0,0,0), 
B(1,0,0), D(0,1,0), E(0,0,1), C(1,1,0), F(1,0,1), H(0,1,1) and G(1,1,1) as shown 
in Figure 


Figure 5.24: A triangle out of three broken sticks. 


The left pieces of the sticks are defined by their right points and I shall describe 
their lengths with the real x,y, z € (0,1). Thus, the cube constitutes the sample space 
for all possible combinations of the breaking points. For x,y,z to form a triangle, we 
need three inequalities: 


Z<uUt+y 
yYy<Ir+z 
Z<yrzZ. 


Note that, say, line AF has the equation z = x and line AH has the equation z = y 
so that the plane through A,F,H is described by z = x + y. The plane divides the 
space into two half-spaces. In the one that contains G, x+y > z. 

Similarly, the plane AFC is described by x = y +z and the plane ACH by y = z +z. 
The set of points that satisfies the three inequalities belongs to the intersection of the 


Chapter 5: Solutions 109 


three half-spaces and the cube: that is, the figure V = ACFHG. It consists of two 
pyramids with the base CFH and apices at A and G. 
V is obtained from the cube by cutting corner pyramids ACFB, ACHD and AFHE. 


Each of these has volume : . : ‘l= ` It follows that the volume of V is 1—3. : = 7 


Solution 2 to Question 1 
Amit Itagi 
Without loss of generality, let the length of the sticks be unity. Let the length of 
the three pieces (sorted) be x > y > z. Let the regions in the X,Y,Z space (and 
the projections of the space) where the sorting holds and where forming a triangle is 
possible be termed “feasible region” and “positive region,” respectively. 

Figure shows the feasible region in the YZ plane in white with the condi- 
tions on x. The conditions in black and blue are for the positive and feasible regions, 
respectively. 


Figure 5.25: Left pieces. 


The probability of triangulation is given by 


1 


1/2 pl—z py+z 1 y 
Ja a J, dxdydz oh. dxdzdy 
z 1/2 J z=1—y J r=y 
1/2 pl—-z 1 y 1 
E J [. dadydz +f dadzdy 
g= 1 1/2 J z=1—y J r=y 


nth 
= È 1 
8 24 
1 
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Solution 1 to Question 2 
A. Bogomolny 


1 
If now x,y,z stand for the lengths of the longest pieces, then 3 < x,y,z < 1. Then, 


1 1 
for example, x + y > 3 + z= 1 > z, and the other two inequalities are similar. We 


conclude that the probability in this case is 1. 


Solution 2 to Question 2 
Amit Itagi 


Figure [5.26] corresponds to this case. Clearly, the probability is 1. 


0 0.2 0.4 0.6 0.8 1 


Figure 5.26: Longer pieces. 


Solution 1 to Question 3 
A. Bogomolny 


1 
If now x,y,z stand for the lengths of the shortest pieces, then 0 < x,y,z < z and 


1 
each of the three is drawn uniformly randomly from the interval (o 5) . The situation 


is similar to question 1 but in a smaller cube. However, since all that matters is the 
relative lengths of the sides, question 1 applies directly. The probability in this case 
1 


is =. 


2 


Solution 2 to Question 3 
Amit Itagi 


This case is the same as the case of “left pieces” except that x, y and z are drawn 
from (0, 0.5] instead of (0, 1.0). The probability does not change by this scaling and is, 


1 
therefore, 5 
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Solution 1 to Question 4 
A. Bogomolny 


First, I should mention quite a few people observed that it does not matter whether 
we choose left or right pieces in question 1 because both side lengths of a broken 
stick have the same probability distribution. This implies that when a random choice 
is made between the left and right pieces for each stick, the probability of getting a 
1 

triangle is no different than in question 1, i.e., z 
It occurred to me that there might be a difference with the case where the choice is 
made between the long and the short pieces, rather than between the left and the right 
ones. Many people insisted there should not be any difference. I made the mistake of 


1 
thinking that there should be no difference since min(x, 1 — x) is distributed on o, z 


1 
while max(, 1 — x) is distributed on E 1! . Tangentially, support also came from the 


difference in probabilities in questions 2 and 3. That was explained and corrected by 
Timon Klugge. 


1 
Where the three numbers z, y, z satisfy 0 < x,y,z < z the broken sticks form a 


triangle. Let A(x, y,z) = 1 if x,y,z form a triangle and 0 otherwise. We could denote 


the probability we found as 
1 1 
P(Al.n2)=110S0n255) = 


Now, if A(x,y,z) = 1 then also A(2z,2y,2z) = 1, such that we were able to 
conclude that in the domain of definition, [0,1], A(x, y, z) = 1 half the time. 
Now, there is a harder road to arrive at the same conclusion. Assuming 0 < 


1 
z,Y, Zz S 7 what is the probability of P(A (x,y,z) = 1)? In other words, what is 


1 
P (^t, y,z)= l and 0 < x,y,z < 5) ? The formula for conditional probability gives 


1 
P (dena) = ] and 0 < z,y,z < z) 


1 1 
=P (Aey) =D eys) Phosas) 


1 
16` 


Similarly, with the result of question 2, we conclude that 


oe 
a 


1 
P(An2) = 1 and 5 LEZ 1) 


1 1 
=P (At.u2=D15 senesi) Pò} senes) 
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Now we have to consider six additional cases that come in two groups of three. If 
S and L stand for short and long, respectively, and we define Sy = min(x,1— x) and 
Ls = max(z, 1 — x), and similarly for y and z, then there are eight combinations SSS, 
SSL,..., LLL. The first, SSS, and the last, LLL, have been just treated. 

We need to find P(A(a,y,z) =1| SSL) and then also P(A(z,y, z) = 1| SLL). 


SSL 
We have three variables x,y,z that satisfy 
Z 1 z 1 5 1 
Bz, YSP >T 


In addition, the variables need to satisfy x + y > z. Points (x,y,z) that satisfy the 
four conditions are located inside the corner pyramid in a small cube as shown in 


Figure 


X 


Figure 5.27: Splitting probabilities for two short and one long pieces. 


1 1 1 
The volume of this pyramid, relative to the larger cube, is 3°68 Cycling 


48 
: : 1 1 
through the three variables gives 3 - B E 
SLL 


In this case we need to satisfy the following inequalities: 


1 1 1 
Sa >y Ra rt+y>z, t+zZz>yŅ. 
The points (x,y,z) that satisfy all five inequalities lie outside two pyramids in the 
small cube shown in Figure 
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= 
=- 


X 


Figure 5.28: Splitting probabilities for one short and two long pieces. 


1 1 1 
The relative volume of that space is 3 (1 2° 5) = iz Cycling through 
; ‘ 1 1 : . 
the three variables gives 3 - 37r Putting everything together, denote S = 


{SSS,SSL,...,55'S}, then, the formula of total probability is 


P(A(z, y, 2) = 1) = XC P(A)z,y, 2) =1|w) 


wES 
1 1 1 1 
=u e ae 
o1 
=z 


As Long Huynh Huu has observed, that was clear to start with. 


Solution 2 to Question 4 
Amit Itagi 


For the case of “left pieces,” the length of the stick has a uniform distribution of (0, 1). 
Choosing one side or the other does not change the distribution for the chosen stick as 


1 
both sides are identically distributed. Thus, the probability for this case remains 7 


Solution 1 to Question 5 
A. Bogomolny 
1 


1 
The expectation that x,y, z € (0, 1) form side lengths of a triangle is Ey y,z = 0- go z 


The same is true if we replace x with 1-— x, y with 1— y or z with 1 — z so that, for 


example, Ey_2,y,2 = =. There are eight such combinations. The eight events are not 


2 
independent; however, the individual expectations can be summed up, which gives the 


1 
total expectation of 8 - 3“ 4. 
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Solution 2 to Question 5 
Amit Itagi 


There are eight combinations depending on which side is chosen for each stick. For each 
combination, the chosen side of a stick can be arbitrarily labeled “left.” This approach 
maps every combination to the “left pieces” case, so the probability of forming a triangle 


1 
is —. Thus, the expected number of triangles is four. 


Probability in Dart Throwing 


Solution 1 
The region where points are closer to the center of the square than to an edge is the 
intersection of the interiors of four parabolas, as shown in Figure 


Figure 5.29: Dart throwing: the whole picture. 


The probability sought is the ratio of the area of that region to the area of the 
square whose side was chosen to be 2. 
The region can be seen to consist of eight congruent pieces, as in Figure [5.30] 


Figure 5.30: Dart throwing: the whole picture. Showing symmetry. 
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Thus, suffice it to calculate the area of one octant (Figure |5.31). Note that the 
square also consists of eight parts, each housing one part of the region, so we need only 
compare the two areas of the paraboloa and the triangle. 


Figure 5.31: Dart throwing: one octant. 


2 
Y Tts 


1 
The parabola is defined by the equation 1 — x = \/x?+y?, or x = 


intersection with the diagonal y = z is found from z? +2gz—1 = 0 to be (/2- 1, vV2— 1). 
It is easier to integrate by y than by x. The area of the shaded region within the 
triangle is 


v2-1 1-4? PoS y yy 
nd ne rs 
0 0 


4/2 -5 


1 1 
The area of the triangle is z 1-1 = 5 and the ratio of the two is 3 ~ 0.21895. 


Solution 2 

Amit Itagi 

Using symmetry, the riddle can be analyzed in the first octant. Let the side of the 
square be 2. The curve that divides the region into points closer to either vertex of the 
vertical edge is the parabola with focus at the vertex and directrix as the vertical edge 


(Figure (5.32). 
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P__\-r(9)cos¢ 


Figure 5.32: Dart throwing in polar coordinates. 


Let a point P in the octant have polar coordinates (r,¢). Its distance from the 
vertex is r and that from the edge is r(1— cos ¢). Thus, the equation of the parabola is 


1 


1) = 1+cos¢ 


Hence, the required probability is 


pa 22a on the vertex side = the parabola g T 12 Gu 
area of the triangle o 2 
-{" do _ sin ¢(2 + cos ¢) TA (4/2 — 5) 0.219 
= Jo «© (1 +cos¢)?— 33(1 +. cos)? jo 3 Se 
Notes 
From the last formula, 
1 1 1 1 1 
M co 2 
ris (1 + cos (=)) oe ie 
n 


Interestingly, this limit could be found with additional calculations. As n grows, 
n-gon approaches a circle and, due to symmetry, the same happens with the intersection 
of the parabolas. The radius of the former is twice the radius of the latter and their 
areas are in the ratio 4: 1 (Figure [5.33). 


0.254 


0.244 


0.234 


9 

N 

N 
1 


Probability 


T T T 
3 4 567 8 9 10 11 12 13 14 15 16 17 18 19 
Vertices 
Figure 5.33: Dart throwing in polar coordinates, n-gon board. 
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Solution 3 
Derek Boeckner 


We generalize the question to n-gons. The region of points that are closer to the center 
than to the boundary of the n-gon is the intersection of the interiors of n parabolas 
with a 2n-fold symmetry. Thus, we need only to focus on one of 2n wedges. 

Let the center of a regular n-gon be at the origin. Take an edge as the line x = 1. 


Mixing in polar coordinates, cut the line off at 0 = 0 and at 0 = Z The boundary of 
n 


the bullseye/target where the center is nearer than the edge is given by a parabolic 
1 


arc with focus at 0 and directrix at x = 1. The equation for this is r = ————.. 
1 + cos(6) 


We only look at the right triangular wedge between 0 = 0 and 0 = Z and then exploit 
n 


symmetry. 
We want the probability of being closer to the origin than this boundary, so we use 
polar integration: 


1 [m 1 a 1 (7 40 
al (cs) a=5 f = a 
1/4 3 ale ae T 
~ 16 (Fem Ca stan (Z)) 


Since the area of the triangle is we have a probability of 


tan (=) 
2 

i x 2 + cos (=) ; (1 + 2cos (=) + cos? (=) 1) 

“Ol f 


3 = 


eO ETO) 
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Notes 
These notes follow the tweets of N.N. Taleb (more authentically: “This is the curse of 
dimensionality behind P 4 NP we used with GMO idiots”); the calculations and the 
graph in Figure [5.34]are by Marcos Carreira. 

An analogous problem formulated in higher dimensions leads to a surprising result: 
the probability of getting nearer to center than to the edge tends to zero. 


0.00 E f 1 L L f 7 7 q 
2 4 6 8 10 


Figure 5.34: Dart throwing in polar coordinates. The bigger the dimension, the less the chance 
of hitting the center. 


Probability in Triangle 


Solution 1 
Leo Giugiuc 


Let x, y and z be the areas of triangles PBC, PCA and PAB. We have 


(y+2z)A+yB+2zC 


K= , 
2(y +z) 

tAt+(z+a2)B4+2C 

L= ; 
2(z + x) 

M= tA+yB+(%+y)C 
2(x +y) 


P is inside A KLM if and only if P = aK + BL + 7M, with a,8,y > 0 and 
a+ß+y=1,ie., 


+8 x x x 
2(z +2) a Ae+y) wex+yt+z 

a d +8 ey d = a 
2(y+z) 2 2(x+y) xr+y+z 

z ee Y z 
“ 2(y + z) re +r) 2 r+y+z 


Solving that system shows that the signs of a, 6 and y are determined by the signs 
of (xz+y—z)(x—y+z), (-w+y+z)(a+y—2z) and (x—y+z)(—x+y+z), respectively. 
Introduce X = —-a#+y+2,Y =x-yt+zand Z=a+y-Z. 
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Next, we show that X,Y, Z > 0. Indeed, assuming to the contrary that, e.g., Z < 0 
we get Y > 0 and X > 0, from which not all a, 8 and y are positive—a contradiction. 
We have 


1 


= 1 yB+C | yCta | gAtB l 
r+y+z 2 2 2 


This exactly means that P is interior to A KLM if and only if Z > 0, Y > 0 
and X > 0. 

In other words, P is interior to A KLM if and only if P is interior to the medial tri- 
angle of A ABC, so the probability of P being in A KLM is the same as the probability 


1 
of its being in the medial triangle, i.e., i 


Solution 2 
Francisco Javier Garcia Capitán 


If P = (x : y : z) in homogeneous barycentric coordinates, then A’ = (0: y : z), and 
K = (y+ z : y : z). From the identity 


(utana = tyta EE 4 ( x+y-+ z){1,0, 0}, 


we have the ratio 


PK —xr+y+z 


AK r+y+z 


The equation —x + y + z = 0 corresponds to the line parallel to BC through the 
midpoints of AB and AC. The ratio PK : AK is positive if and only if P lies in the 
half plane determined by this parallel not containing A. 

Therefore, the point P is interior to A KLM when P is interior to the medial triangle 


1 
of A ABC and we get that the sought probability is T 


Chapter 6 


Combinatorics 


Probability is expectation founded upon partial knowledge. A perfect 
acquaintance with all the circumstances affecting the occurrence of an 
event would change expectation into certainty, and leave neither room nor 
demand for a theory of probabilities. 


George Boole, 1854, An Investigation of the Law of Thought 


Riddles 


6.1 Shuffling Probability 
Problem 91] 


Integers 1 through 53 are written on cards, one per card. The stack is thoroughly 
shuffled. Five cards are drawn. What is the probability that the cards are drawn in 
their natural order, the smallest first and the rest in increasing order of magnitude? 


6.2 Probability with Factorials 
Problem 10] 


Let R be the set of rational fractions from (0,1) in lowest terms so that only if 
0 <n < dand gcd(n, d) = 1. For K € N, define 
A(k) ={x:ce R,nd= K!} 


and set A = A(20) U A(21) U A(22). 
What is the probability that a random number drawn from A belongs to A(20)? 
Answer the same question for 


A= U A(K). 


K=20 


6.3 Two Varsity Divisions 
103 


20 varsity teams are assigned randomly to two divisions. What is the probability that 
the two strongest teams will happen to be 


e in different divisions? 


e in the same division? 
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6.4 Probability of Having 5 in the Numerator 
Problem 10] 


Let set S consist of all rational fractions in lowest terms from (0,1) such that if ; es 


then ab = 10! 
What is the probability that the numerator of a random member of S is divisible 
by 5? 


6.5 Random Arithmetic Progressions 
pp. 37-40] 
Five numbers are drawn at random without replacement from the set {1,..., n}. Show 


that the probability that the first three numbers drawn, as well as all five numbers, 
can be arranged to form arithmetic progressions is not less than 


1 
n—-1\" 
3 
6.6 Red Faces of a Cube 
p. 78] 
The outside of an n x n x n cube is painted red. The cube then is chopped into n° 
unit cubes. The latter are thoroughly mixed up and put into a bag. One small cube is 


withdrawn at random from the bag and tossed across a table. What is the probability 
that the cube stops with the red face on top? 


6.7 Red and Green Balls in Red and Green Boxes 
C.W. Trigg, Problem E1400, p. 698], 


There are six red balls and eight green balls in a bag. Five balls are drawn randomly 
and placed in a red box; the remaining nine balls are placed in a green box. What is 
the probability that the number of red balls in the green box plus the number of green 
balls in the red box is not prime? 


6.8 Probability of Equal Areas on a Chessboard 


A random rectangle is drawn on an 8 x 8 chessboard with the sides of the former 
parallel to the sides of the latter. What is the probability that the rectangle contains 
the same amount of both colors? 

Consider two cases (where measurements are relative to a 1 x 1 square side length): 


e The side lengths of the rectangle are real numbers. 
e The side lengths of the rectangle are integers. 


This problem arose in the discussion following Stan Wagon’s article on the prob- 
lem of integer rectangles [95]. The follow-up article, by Nathan Bowler, is available 
online [22]. 
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6.9 Random Sum 
Ten coins are labeled with numbers 1 through 10 on one side. All 10 coins are tossed 


and the sum of the numbers landing face up is calculated. What is the probability that 
this sum is at least 45? 


6.10 Probability of Equilateral Triangle 


In a circle of radius R there are six chords, equal in pairs: {R, R,a,a,b,b}. The chords 
may form inscribed hexagons in a variety of ways. 


What is the probability that some three endpoints form an equilateral triangle? 


6.11 Shelving an Encyclopedia 


An encyclopedia with 26 volumes (A, B,C,...,Z) is put randomly on a shelf. 


1. What is the probability that volumes A and B appear next to one another in 
the correct order AB? 


2. What is the probability that volumes A, B and C appear next to one another in 
the correct order ABC? 


3. What is the probability that any two successive volumes appear next to one 
another in the correct order? 


6.12 Loaded Dice | 
Problem 10] 


Prove that no matter which two loaded dice we have, it cannot happen that each of 
the sums 2,3,...,12 comes up with the same probability. 


6.13 Loaded Dice II 


It is impossible to load two dice so as to have all 11 outcomes equiprobable. Is it 
possible for 10 selected outcomes? 

This is a continuation of Riddle It appeared in the chapter “Mathematical 
Creativity in Problem Solving and Problem Proposing II’ by Murray S. Klamkin in 5l. 
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6.14 Dropping Numbers into a 3 x 3 Square 
Problem CC246| 


Place the numbers 1,2,...,9 at random so that they fill a 3 x 3 grid. 


1. What is the probability that each of the row sums and each of the column sums 
is odd? 


2. What if both diagonal sums are also required to be odd? 


3. What if exactly one of the diagonal sums is also required to be odd? 


6.15 Probability of Matching Socks 


In a drawer, there are r red, b blue and g green socks. Two are drawn at random. What 
is the probability of getting a matching pair? 


6.16 Numbered Balls Out of a Box 
Problem 320] 


A box contains 20 balls numbered 1,2,...,20. If three balls are randomly taken from 
the box without replacement, what is the probability that one of them is the average 
of the other two? Answer the same question for 21 balls. Which probability is larger? 


6.17 Planting Trees in a Row 
Section 10] 


A gardener plants three maple trees, four oak trees and five birch trees in a row. He 
plants them in random order with each arrangement being equally likely. What is the 
probability that no two birch trees are next to each other? 

Consider two cases: 


1. If trees of the same type are indistinguishable? 


2. If all trees are different? 


6.18 Six Numbers, Two Inequalities 
Donald Knuth’s puzzles] 


Six distinct digits, say 1, 2, 3, 4, 5, 6, are placed randomly into the boxes below. 


1. What is the probability that the indicated inequalities hold between the digits 
adjacent to the inequality symbols? 


La 3 | | 


2. Answer the same question concerning another placement of the inequality signs: 
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6.19 Six Numbers, Three Inequalities 
Donald Knuth’s puzzles] 


Six distinct digits, say 1, 2, 3, 4, 5, 6, are placed randomly into the boxes below. 


1. What is the probability that the indicated inequalities hold between the digits 
adjacent to the inequality symbols? 


6.20 Tying Knots in Brazil 


A is clutching three identical pieces of string in her fist, as illustrated below on the left. 


She asks $ to tie two ends of the string, chosen at random, at either side of her fist, as 
illustrated above on the right, so that there is one free end on either side. The result 
could be as shown below: 


1. What is the probability that all the pieces are joined in one long piece? 
2. With four strings? 


3. With five strings? 
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6.21 Tying Knots in Russia 
Problem 76] 


In some rural Russian communities, there existed the following form of divination: a 
girl would clutch in her fist six grass blades so that the ends of every one of them stuck 
out at both sides. A friend would then tie pairwise the top ends and then also the low 
ends. If, as a result, the grass blades formed a ring, it was a sign that the girl would 
be married within a year. 


1. What is the probability that the grass blades would be tied into a single ring? 


2. Answer the same question with 2n blades. 


6.22 Guessing Hat Numbers 
Problem 112] 


1. Five students each blindly choose a hat among six numbered 1,...,6. Each 
announces the largest and the smallest number he/she sees. 


What is the probability that each will be able to guess his/her own number? 


2. Answer the same question with n hats and k < n students. 


6.23 Probability of an Odd Number of Sixes 
Problem 470] 


1. What is the probability of an odd number of sixes turning up in a random toss 
of n fair dice? 


2. What is the probability of an even number of sixes turning up in a random toss 
of n fair dice? 


3. Are the two numbers equal? Why or why not? 


6.24 Probability of Average 
Problem 320] 
A box contains 20 balls numbered 1, 2,..., 20. If three balls are randomly taken out of 


the box without replacement, what is the probability that the number on one of the 
balls will be the average of the other two? 


6.25 Marking and Breaking Sticks 
pp. 36-41] 


A person makes two marks—randomly and independently—on a stick, after which the 
stick is broken into n pieces. What is the probability that the two marks are found on 
the same piece? 

Compare two cases: when the pieces are equal and when the division is random. 

As usual, when no distribution is specified, the word “random” refers to the uni- 
form distribution. “Independently” means independent of any previous action. This is 
especially important in the second part of the riddle. To avoid ambiguity, assume that, 
prior to breaking the stick, the n — 1 marks are made randomly and independently of 
all the marks already made. 


Chapter 6: Combinatorics 127 


6.26 Bubbling of Sorts 
pp. 21-22] 


A given sequence r1,T2,...,Tn of distinct real numbers can be put in ascending order 
by means of one or more “bubble passes.” A bubble pass through a given sequence 
consists of comparing the second term with the first term and exchanging them if and 
only if the second term is smaller, then comparing the third term with the second 
term and exchanging them if and only if the third term is smaller and so on in order, 
through comparing the last term rn with its current predecessor and exchanging them 
if and only if the last term is smaller. 

The following example shows how the sequence 1, 9, 8, 7 is transformed into the 
sequence 1, 8, 7, 9 by one bubble pass. The numbers compared at each step are under- 
lined. 


1 9 8 7 1 9 8 7 18 9 7 1 8 7 9 


Suppose that n = 40 and that the bubble sort algorithm applies to the random 
sequence of distinct numbers r1,72,...,749 in order to place them in ascending order. 


1. Find the probability that the number that begins as rọ will end up, after one 
bubble pass, in poisition #30. 


2. Find the probability that the number that begins as rg9 will end up in poisition 
#30 after two bubble passes. 


3. Find the probability that the number that begins as rg9 will end up in poisition 
#30 after n < 10 bubble passes. 


6.27 Probability of Successive Integers 
Problem 2] 


Five distinct numbers are chosen from 10, 11, 12,..., 99. What is the probability that 
at least two of them are consecutive? 
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Solutions 


Shuffling Probability 


You have five cards which may have been drawn in any of 5! = 120 possible ways. Only 


1 
one of these is the natural increasing order. The probability of this happening is —— 


120° 
The specific number of cards is a red herring [19]. No matter which number you 
start with, as long as you draw five cards, there are 5! = 120 variants. Each comes 


1 


with probability of 120° 


Probability with Factorials 


Let us list the prime factors of, say, 20!. All are less than 20; there are eight of them: 2, 
3, 5, 7, 11, 13, 17, 19. Since we are only concerned with the fractions in reduced form, 
these factors are somehow distributed between their numerators n and denominators 
d (in whatever power it takes), but no factor appears in both n and d. Therefore, there 
are 28 = 256 possible values for pairs n,d. For only half of them n < d, making the 
number of eligible fractions in A(20!) = 128. 

Since there are no primes between 19 and 23, A(21!) and A(22!) also have 128 
elements each. Since the pairwise intersections of the three sets are empty, a number 


1 
drawn from their union has equal probability to be drawn from either of them, i.e., z 
23!, 24! and 25! include an additional prime—23, implying that each contains 


1 
z2 = 256 elements. Thus, in this case, |A| = 3-128 + 3 - 256 = 9 - 128. Thus, 


1 
the probability of the drawn number to be from A(20) is J 


Two Varsity Divisions 
Solution 1 
P DEA . 1/20 . 
20 teams may be split into two divisions of 10 teams each in 3\10 ways. With two 


teams removed, 18 are left to be split between the two divisions. For the first question, 
we need nine teams to complete the division with one of the strongest teams (the other 
division will be filled automatically). Thus, the probability in question 1 is 


(3) 
9 10 
P= =—_, 
1/20\ 19 

2\10 


For the second question, we need eight teams to complete the division of the two 
strongest teams. Thus, the probability in question 2 is 


p- 4s) o 
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Solution 2 


Denote, for convenience, the two strongest teams A and B. When A is chosen into one 
of the divisions, there are 19 slots that B can fit in. For the first question, 10 of these 


slots are in the “other” division, giving the probability as ae 

For the second question, there are nine slots in the same division as A, making the 
probability of B falling there equal to 7 

Naturally, if A and B are not in the same division, then they are in different ones 


10 9 
so that jg + 79 =! 


Probability of Having 5 in the Numerator 


Solution 1 


There are four prime numbers below 10: 2, 3, 5, 7. Their factors may appear in either 


a 
numerator or denominator of 5 from S$ but not both. All in all, there are 2+ fractions 


: that satisfy ab = 10!. Half of them are less than one; half of them are more than 


one. It follows that |S] = 2°. 
Now 10! = 28 . 34 . 52 . 7 = 256 - 81 - 25 : 7. There are only two fractions below one 
with 5 in the numerator: 
25 25-7 
and ; 
256 - 81-7 256 - 81 


D> A 
for, already 25-81 > 256-7. Thus, the probability in question is a 


Solution 2 
Amit Itagi 


ab = 10! = 28 . 34 . 52 . 7 = 256 - 81 - 25 - 7. For ; to be in the lowest form, there should 
be no common prime factor of a and b. Moreover, a < b. Thus, 
a € {1, 256, 81, 25, 7,256 - 7, 81-7, 25-7}, 


10! 
with b= ae The cardinality of the set is eight and the number of elements divisible 
a 


1 
by 5 is two. Hence, the required probability is T 


Random Arithmetic Progressions 
Solution 1 
n 
There are (5) ways to choose four numbers out of n. This is the size of our sample 


space. Not all five-term sequences admit an arrangement into an arithmetic progression. 
Those that do can be identified by two numbers, say x and y, playing the role of the 
smallest and the largest terms of the progression. x and y need to satisfy x = y (mod 4). 


130 Cut the Knot: Probability Riddles 


Let N(n) be the number of such pairs. Thus, the probability of getting a five-term 


N 
arithmetic progression equals P; = (nr) 
n 


If five numbers {ao,@1,d2,a3,a4} form an arithmetic progression, then so do 


{ao, a1, a2}, {a1, a2, a3}, {a2,a3,a4} and {a1, a3, a5}. In other words, out of 10 = (3) 


possible selections of three elements, only four amount to an arithmetic progression. 
Combining the two estimates in one, the probability in question equals 


4 
Tegma 
n 
6) 
-1 —4 
If n = 4m, N(4m) = 4- e = BSS) because there are four remainders 


2 8 
modulo 4 and, for each, m terms that can be paired with other m — 1 terms. 
Now if n = 4m — k, where k = 1,2,3, define n! = n +k. As before, there are 


1 / 
—4 
a pairs of numbers, but several have been just added. So to correct the mishap, 
It 1 
8 4 
_ (n+k)(n+k-—4)-— 2k(n+k-— 4) 
E 8 
_(n+k-4(n-4)_ n?—4n 4k-k? 
j 8 are: 8 
> n(n — 4) 
2 3 ; 


—4 
because k < 4. Thus, in any event N (n) > ae implying 
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Solution 2 
N.N. Taleb 


n! 
Denominator: There are ——z Permutations of subsample S = {1,...,n}. 


(n—5) 
Numerator: We look for € = ar 


General solution for 5: We have arithmetic sequences (five members) separated by 
¢ = {1,2,...,€}, such that the sequence will be expressed as a1, a1 +Ç, a1 +2¢...a, + 
4¢, and constrained by a; + 4¢ <n. 

We can show that there are n — 4¢ possible increasing sequences of progression of 
difference ¢. 

There are 5! permutations for any five selected numbers. The number of sequences is 


E 
So (n= 46) = 2E + n€ — 2€. 
¢=1 


The probability of having a progression becomes 


_ BI (—2€? + né — 2) 
E n! 


(n — 5)! 


For large n or values of n divisible by 4, 


15 
(n — 3)(n — 2)(n—1)’ 


p= 


which is greater than 


1 = 6 
n—=1\ (n—3)(n—2)(n—1) 
3 
General solution for 3 then 5: Conditional on having the draws with arithmetic 
sequence {X(1), X,2),---,X(5)} in various permutations of 6-3) = 60 counts of the 


first three draws, how many include a sequence? 

We have six counts for {X(1),X2),X(3)} (that is 3!) plus six counts 
of {X(2),X(3),X4)}, plus six counts of {X(3),X(4),X(5)}, plus six counts of 
{X (1), X(3),X(5)} for a total of 24 counts, so the correction is 


24 


PX 6: 


Red Faces of a Cube 


Solution 1 


All faces are equiprobable, making the sought probability equal to ne 
n n 
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Solution 2 
Amit Itagi 


For n = 1, the probability is 1 as all faces of the cube are painted. 
For n > 1, let f(k,n) be the number of unit cubes that have k faces painted: 


(n— 2), fork =0 

6(n — 2)”, fork=1 
f(k,n) = $ 12(n—2), fork =2 

8, fork =3 

0, for4<k<6. 


Thus, the required probability is 


6 
P(n) = 5 P(red face up|cube has k faces painted)P(cube has k faces painted) 


k=0 
6 
aS yee 
ranean n3 
1 
a [6(n — 2)? + 24(n — 2) + 24] 
at 
=<. 


I was wondering how we got such a nice-looking answer. There are a total of 6n3 
faces, out of which 6n? are on the outer surface. Therefore, choosing a random cube 
and a random face is equivalent to choosing a face randomly from the 6n° possibilities. 


Solution 3 
Joshua Zucker 


Think of the cube as a conglomeration of n? columns of 1 x 1 x 1 cubes stacked on 
top of each other. Of the n small cubes in each such column, only one has its top face 
painted red. 


Red and Green Balls in Red and Green Boxes 


Let g stand for the number of green balls in the red box. Then, since the green box 
ought to have at least one green ball, we have the following distribution of colors: 


| Red Box (5 Balls) | Green Box (9 Balls) | 
g green 8 — g green 
5—g red g+1red 


Therefore, the number of green balls in the red box plus the number of red balls in 
the green box equals g + (g + 1) = 2g +1. 

Note that, since 0 < g < 5, 1 < 2g + 1 < 11. The only non-prime odd numbers in 
this range are 1 and 9; for these, g = 0 or g = 4. The probability of drawing all five 
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red balls or just one red ball is given by 


OOG) ore as 


(5) ~ 9002 1001 
5 


Probability of Equal Areas on a Chessboard 


First of all, we shall prove Nathan Bowler’s claim. 
A rectangle with sides parallel to the sides of a chessboard covers equal areas of 
both colors if and only if one of the two conditions holds: 


e One of the sides of the rectangle is an even multiple of the side of a chessboard 
square. The dimension of the other side is irrelevant. 


e The center of the rectangle lies on a grid line. 


For simplicity, let us call any segment double a square’s side in length even. 

The sufficiency (the “if” part) is pretty obvious. If, say, the vertical dimension of 
the rectangle is double a square’s side, then in every vertical cross-section the rectangle 
segments of different colors add up to the same lengths. If the center of the rectangle 
lies on a grid line, then, by symmetry, the statement is also true. 

Observe that every rectangle can be split into at most three parts: one with an 
even horizontal dimension, one with an even vertical dimension and one with both 
dimensions less than two. 

The problem of showing the necessity, i.e., the “only if” part, is thus reduced to 
showing that, in a rectangle with sides less than two, there is more of one color than 
the other, provided the center of the rectangle does not lie on a grid line. Here are the 
three prototypical possibilities: 


BE- 
Vy Vy 


1 1 vy 
% »2 y2 


x4 Xo 


In all cases, 
0 < z1,£2 < 1, and 
0<m,Yyo <1. 
Also, 


0< zı +z2< 1 


in the first case, and 


0< yi +y2< 1 


in cases 1 and 2. 
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If A, and Ag are total areas of the regions of the light and dark colors inside the 
rectangle, then, in the first case, we get 


Ay = T1 + £2 + Y1 + Y2, 


Az = (£1 + £2)(y1 + y2) +1. 


The difference of the two is 
Ay — Ag = — (£1 + z2 — 1)(y1 + y2- 1), 


which is never zero. 
In the second case, we get 


Ay = £1Y1 + T2 + T1Y2 = T1 (Y1 + yo) + T2, 


Ag = oy) + T1 + T2Y2 = T2(Y1 + Y2) + z1. 
The difference of the two is 


A, — Ag = (z1 — £2)(y1 + y2 — 1), 


which may be zero if and only if zı = £9, i.e., in the excluded case where the center 
of the rectangle lies on a grid line. The last case is treated similarly. 

Due to the fact that the probability of a random rectangle having its center on a 
grid line is zero, the riddle has an unexpected solution: for a rectangle with an even side, 
the probability of covering equal white and black areas is one; for the less fortunate 
rectangles, the probability is zero. 

As having an integer, let alone an even, side has the probability of zero, zero is the 
answer for the first case. 

For the second case, the sides of the rectangle may be independently odd or even. 
There are several interpretations. 

One answer is that in three out of four possibilities at least one side is even, making 


the probability of having equal areas - 


However, this is a simplified treatment of the riddle. The availability of even and 
odd side lengths may depend on the distance of a selected corner to the sides of the 
chessboard. 

On an 8 x 8 board, integer rectangles may be of size M x N, where 1 < M,N <7. 
Thus, for M = 2, 4,6, there are seven rectangles with at least one side even, 21 in all. 
For each of M = 1,3,5,7, there are three rectangles with at least one side even, 12 in 
all. Therefore, the total number of integer rectangular sizes that cover equal white and 
black areas is 21 + 12 = 33. This is out of the total of 7 x 7 = 49 possible sizes, giving 


33 
th bability of —. 
e probability o 19 


If we count the frequencies of various rectangle sizes, the answer changes again. 
The sizes M x N are tabulated separately depending on odd/even combination of side 
lengths (the tables are by Thamizh Kudimagan): 
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M/N 1 3 5] 7 

1 64 48 | 32 | 16 

3 48 36 | 24] 12 

5 32 24; 16| 8 

7 16 12 8| 4 
Total : 400 

M/N 1 3 5] 7 

2 56 42 | 28) 14 

4 40 30 | 20 10 

6 24 18 | 12] 6 

8 8 6 4| 2 
Total : | 320 x 2 = | 640 


M/N 2 4 6| 8 
2 49 35| 21) 7 
4 35 25] 15] 5 
6 21 15 9/3 
8 7 5 3] 1 
Total : 256 
Thus, an integer rectangle covers equal areas of black and white in 896 = 640+ 256 
896 
out of 400 + 896 = 1296 cases, i.e., with the probability of 1296 ` a 


Random Sum 


Solution 1 
The maximum possible sum is 1+2+---+10 = 55. 45 = 55—10. It is easier to calculate 
the probability that the sum is at most 10. This would be the sum of numbers written 
on the coins that fell face down. Just by inspection, there are 43 combinations of 
numbers from 1 to 10, each taken at most once, that sum to at most 10. 

The sample space consists of 2” possible outcomes, making the desired probability 


43 
equal to an" 


Solution 2 
Mike Lawler 


Sequence A000009 from the On-Line Encyclopedia of Integer Sequences lists, for 
a given index N, the number of partitions of N into distinct parts: 


10 


N 012 3 4 5 6 7 8 9 
1 1 122 3 4 5 6 8 10. 
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For example, 


7=74+0=641=54+2=4+43 
=1424+4 


8=8+0=74+1=64+2=54+3 


=1424+5=1434+4 


Summing up the 11 numbers in the second row of the above table gives 43. 


Solution 3 
Robert Frey and, independently, N.N. Taleb 


The expansion of the generating function II (1 +q”) where S is the set under consid- 
nes 
eration (here 1,2,...,10) into polynomials, such as a1q+a2q?---+amq™, will produce 
coefficients that equal the number of distinct partitions of m. 
10 
We are considering values 45 < j < Soi = 55: 
i=1 
10 
Il a+) = që gët +g +29q°2 +2¢°! +39 +4949 +598 +692" +8q2°-+10q2° +: , 
i=1 


with coefficients {1, 1, 1, 2, 2, 3, 4, 5, 6,8, 10} that sum to 43. The total number of tuples 


43 
under consideration is 21°, Hence the probability is 510 © 041992. 


Probability of Equilateral Triangle 
Amit Itagi 


Let ®(a) be the angle subtended by a chord of length x at the center. ®(R) = 60°. 
Let us work under the assumption that a Æ R. This also implies that a Æ b for, if 
a = b, then the only way 6(R) + ®(a) + (b) = 180° is if a = b = R. Without loss 
of generality, let a < R < b (one of a and b has to be smaller than R and the other 
greater than R because ®(a) + (b) = 120°). Let ®(a) = a and ®(b) = 120° — a. The 
only additive combinations of the ®s that result in 120° are 2®(R) and ®(a) + (b). 
Thus, the allowed permutations of {R, R,a, a,b, b} to get us an equilateral triangle are 
the permutations of (R, R, a,b, a,b) where permutations are allowed within each of the 
two (a,b) units and the entire set can be permuted in a cyclic manner. 
The total number of permutations of the set is 


6! 


Koo 


Protal a= 
with each two in the denominator discounting the permutations within the same kind 
of chords. The number of valid permutations of the kind (R, R,...) is 2x2 = 4 (two per- 
mutations within each (a,b) unit). Six cyclic permutations can be generated from each 
of the four permutations. Thus, Pyaliq = 4 x 6 = 24. Hence, the required probability is 

Pyatia 24 


4 
"o9 = Boal 90 15 
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Shelving an Encyclopedia 


Solution 1 to Question 1 
Vincent Pantaloni 


All said, there are 26! ways to randomly shelve 26 volumes. If A and B are considered 

as an immutable pair, like being glued together, the whole of 25 volumes could be 

placed randomly in 25! ways. Thus, the probability of having volumes A and B next 
1 


to each other in the right order is = =e 


Solution 2 to Question 1 
Pair AB could be in any of 25 positions while the remaining 24 volumes can be per- 


25- 24! 1 
muted arbitrarily, giving the probability of -a Be 


Solution 1 to Question 2 


Employing the reasoning from the first solution to the first question, glue volumes A, 
24! 1 
B and C together to obtain a collection of 24 books. The answer then is zl 26.25 


Solution 2 to Question 2 
Employing the reasoning from the second solution to the first question, the sequence 
ABC can be located in any of 24 places. The other 23 volumes can be permuted 


24 - 23! 1 
arbitrarily, giving the probability of 7 = = 96.95" 


Solution to Question 3 

Let A(n) denote the set of permutations of N,4, = {1,2...,n,n + 1} that do not 
contain two successive numbers next to each other in the right order, i.e., the permu- 
tations with no substrings k(k + 1), whatever k, and let a(n) = |A(n)| be the number 
of elements of A(n). (This is sequence A000255 from [85].) Note that we are looking 
for the number (n + 1)! — a(n), (n = 25) of permutations that do contain such pairs of 
successive numbers. 

A permutation 0,41 € A(n) can be associated with a number of permutations from 
A(n—1) when n+1 is excluded. Permutations from A(n—1) with n last can be looked 
at as members of A(n—2) and n = 1 can be properly inserted in any of n— 1 positions, 
0,1,...,n—2, which produces (n — 1) -a(n — 2) permutations from A(n). On the other 
hand, given a permutation from A(n — 1) with n in, say, position k # n, the term 
n+ 1 can be inserted in n positions: 0,1,...,4-—1,k+1,...,n, leading to the total of 
n -a(n — 1) permutations. Thus, we are led to a recurrence relation: 


a(n) =n-a(n—1)+(n—-1)- a(n — 2), 


n! — a(n — 1) 


Thus, one answer to question 3 is 
n! 
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Given the formula for the number of derangements dn of n elements, it is straight- 
forward to verify that 


d 
aln- 1) = 2, 
n 
š x ý dn+1 
which gives another answer to question 3, 1 — T 
n-n! 


Loaded Dice | 


Solution 1 


Let p; be the probability of i, i = 1,...,6, coming up for the first die and q; be the 
probability of i coming up for the second. Assuming all the sums come up with the 


1 
same probability, the latter equals TT Then pıqı is the probability of the sum being 


1 1 
2 such that pıqı = IT Similarly, peqe = TI is the probability of the sum being 12. 
The probability of the sum being 7 is 


1 
cin 5 Pkdj > P1d6 + Pod 
k+j=7 


1 1 
_ Pı Pe _ Pı m Pe > os 
llpe lp 11 \pe pi 11 


by the AM-GM inequality. That is a contradiction. 


Solution 2 
6 + 
Let P(x) = 5 pix?! be a probability generating function for the first die, Q(x) = 
k=1 


6 
5 qix’! that for the second. Since pp, qp # 0, each of P(x), Q(x) is a polynomial of 
k=1 
degree five and, as such, has a real root. 
On the other hand, 


12 
1 1 g"-1 
P = Sod k-2 ay 
(@)Q@) =) a? if gaa 
k=2 
10 
The contradiction comes from the fact that the cyclotomic polynomial 5 x! does 
k=0 


not have real roots. 


Notes 

The two solutions are automatically extended to dice of different shapes, e.g., all Pla- 
tonic solids. However, the second solution would not work for a polyhedron with an 
odd number of faces, e.g., for a cube with a corner sawed off. 
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Loaded Dice II 


6 
Let P(x) = X pic’ be a probability generating function for the first die, Q(x) = 
k=1 


6 
5 qx’ that for the second. The polynomial coefficients are the “weights” for each of 
k=1 
the indices. Let us set pı = 0, meaning that the first die never comes up with 1 on 
top. Factor out x? from P and x from Q: 


P(x) = x? Po(x) = x° (p2 + psx + pax” + psx? + per’), 
Q(x) = tQo(x) = x(q + Geox + 9327 + ax? + q5x4 + gor"). 


Assuming that the outcomes 3,4,...,12 are equiprobable, 


Po(x)Qo(x) = iltetatt 42%) 


z? —1 1 (@®—-1)(2° +1) 
~ 10 z-1 10 a—1 

1 lte+a?4+a34+a4 142° 
10 5 T 


This means that there is a choice: 


1 
pı =0, P2, P3, Pa, Ps, Po = = 


1 


G46 = z q2 = 93 = q4 = q5 = 0. 


The second die is easily created by taking a regular cube and placing numbers 1 and 
6 on three faces each. Both of them can be simulated by a pair of regular icosahedra 
(each with 20 faces): one with 10 faces bearing 1 and 10 faces bearing 6, the other with 
numbers (or number of dots) 2, 3, 4, 5, 6 on four faces each. 


Dropping Numbers into a 3 x 3 Square 


Solution 1 to Question 1 
Missouri State Problem Solving Group 


In order for the sum of three numbers to be odd, they must either all be odd or exactly 
one number must be odd. There are five odd numbers available, so one row must have 
three odd numbers and the other two rows must have exactly one odd number each, 
with the analogous result for columns. Once we choose a row with three odd numbers 
and a column with three odd numbers, we have used up all five available odd numbers. 
There are 3 x 3 = 9 ways of choosing the row and column, 5! ways of determining 
where the odd numbers are, and 4! ways of determining where the even numbers are. 
This gives a total of 9 x 5! x 4! matrices with the desired property. 


There are a total of 9! matrices with entries from 1,2,...,9, so the probability we 
seek is 
9x5!x4! 1 


9! 14° 
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Solution 2 to Question 1 
Amit Itagi 


We have four even and five odd numbers. The parity combinations of three numbers 
that result in an odd sum are {O, E, E} and {O,O, O}. 

The only possibility is that two rows have two even numbers each and the third 
row has no even number. The same logic applies to the columns. Thus, we have to 
choose the two rows and the two columns. This can be done in 3 x 3 = 9 ways. The 
two rows and two columns set the positions for the evens. The remaining positions are 
for the odds. Thus, there are 4! ways of permuting the evens and 5! ways of permuting 
the odds. The total number of arbitrary permutations is 9!. Thus, the probability is 

P= 9. 4!-5! _ 1 


1 eee ere 


9! 14 


Solution 1 to Question 2 
Missouri State Problem Solving Group 


If there is a column and a row with three odd numbers each, there is no way to place 
three odd numbers on a diagonal. In any event, there is a unique odd number that 
is shared by the chosen row and column (of each of the three odd numbers). For a 
diagonal sum to be odd, this number needs to be on the diagonal. Only in one case 
out of nine are both the diagonal sums odd; this is when the common odd element is 
at the center of the matrix. The probability of this is 


Bix 4! 1 
9 1267 


Solution 2 to Question 2 
Amit Itagi 


For one diagonal to have an odd sum, two of the four even positions in the preceding 
question need to have the same row and column number. If those positions are (i,t) 
and (j, j), the remaining evens have to occupy (i, j) and (j,i) to satisfy the conditions 
in question 1. Now the only way the other diagonal sums to an odd number is if (i, 7) 
and (j,i) fall on the other diagonal and (i, i), (j, j) do not fall on that diagonal. Thus, 
{i,j} = {1,3}. The nine possible ways of choosing the even positions drops to one and 
the probability shrinks by a factor of nine to 
1 1 


panasi 
2? 9.14 126 


Solution 1 to Question 3 
Missouri State Problem Solving Group 


If the selected point is in one of the corner matrices then the corresponding diagonal’s 
sum is odd while that of the other is even. There are four corners and the probability 
of such an event is 


4x5lx4 4 2 


9! 126 63` 


Chapter 6: Solutions 141 


Solution 2 to Question 3 
Amit Itagi 


For the main diagonal to have an odd sum, we place the two evens at (7,7) and (j, j) 
as in question 2. However, to keep the sum of the other diagonal even, (i,j) and (j,i) 
need to be off that diagonal. Thus, {7,7} = {1,2} or {2,3}. Now instead of nine ways 
of choosing the positions of evens in question 1, we drop to two ways of choosing the 


9 
positions of evens and the probability drops by a factor of z However, we could as well 


have chosen the second diagonal to give an odd sum instead of the first. This doubles 
the probability. Thus, 


Probability of Matching Socks 


Solution 1 


To start with, instead of looking for a matching pair, let us find the probability that 
both socks are red. 
The probability of getting one red sock is 


r 
r+b+g 
Assuming that the first sock is red, the probability of getting a second red sock is 
r—1 
r+b+g-1 
When it comes to calculating probabilities, colors do not make much difference: an 


analogous argument applies to blue and green socks, implying that the probability of 
getting two blue socks is 


b(b — 1) 
(r+b+g)(r+b+g-1) 
and that of getting two green socks is 


g(g — 1) 
(r+b+g)(r+b+g-1) 


The answer to the question is then 


r(r—1) + 0(b— 1) + g(g—-1) 
(r+b+g)(r+b+g-1) ` 


The formula is eminently reasonable for, if any of the three numbers is 1, the effect 
of having an unmatched sock in a drawer is to increase the denominator without adding 
anything to the numerator. 
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Solution 2 


b 
There are (" T 5 a ") ways to select two socks out of the total of r +b + g. There are 


(5) ways to select two out of r red socks. Therefore, the probability of selecting two 


MON 
(9) 


red socks is 


P(2 red socks) = 


E r! 2!(r+b+g-2)! 
~ Nr — 2)! (r+b+g)! 
r(r —1) 


~ (r+b+g)(r+b+g-1) 


This is the same result as before, and it is similar for the other colors. 


Numbered Balls Out of a Box 


Solution 1 

The average of two integers is an integer if the two summands are of the same parity: 
either both are odd or both are even. These two are the largest and the smallest of the 
three while the middle one is uniquely determined by these two. 


1 
The number of ways to choose two even numbers out of 20 is ( À = 45. Similarly, 


there are 45 odd pairs. 
2 
The total number of possible selections of three balls out of 20 is a = 1140, 


giving the required probability as 
45+45 9 3 


1140 114 38` 
If there are 21 balls, then the corresponding probability is 


10 P 11 
2 2 _45+55_ 10 3 
@ ~ 1330 133 ` 38° 
3 


Solution 2 

Let the total number of balls be N. Out of the three numbers chosen, let i be the 
middle number. If the smallest number is i — k, then the largest number has to be 
i+ k. Thus, for a fixed i, the number of ways of choosing the other pair of numbers is 
equal to the number of ways of choosing a valid k and is given by min{i — 1, N — i}. 
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Thus, the number of ways of choosing the balls satisfying the constraints is 


N 
N.= 5 min{i-1,N -¢} 
t=1 


eal N 
=a ae 
i=l i141 ¥] 


C= 14+ (20-i) 45+45 3 
P(2 — v= i= = 7 d 
e C (20,3) 140 38°" 
P(21) = D e e O _ 10 
C (21,3) 1330 133° 


P(20) > P(21). 


Planting Trees in a Row 


Solution 1 to Question 1 


Let us solve the riddle in general with m maple trees, o oak trees and b birch trees. 
There is a total T of arrangements of the three kinds of trees: 


_ (m+o+b)! 
is m! o! b! 
There are 
Ipa m+o _ (m+o)! 
o m!o! 


ways to arrange maple and oak trees and 


N= m+o0o+1\ — (m+ọ0o+1)! 
7 b ~ (m+o+1—b)!b! 


ways to place b birch trees so that no two are adjacent. The probability we are interested 
in is 


p WN __(mto)! (m+o+1)! m! o! b! 

— T mlo (m+o+1—bylb! (m+o+b)! 

= (m+ o)!(m+o+1)! _ __(mto+l)!  (m+o0)!b! 
~ (m+o+1—db)!\(m+o+b)! (m+o+1—d)!b! (m+o+b)! 


Oy) 
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Solution 2 to Question 1 
Amit Itagi 
First, order the maple and the oak trees (7! ways). Plant the birch trees in five of the 
possible eight locations (on either side of the planted trees or in the middle of two 
planted trees) for each birch tree (P(8,5) ways). Without restrictions, there are 12! 
ways to plant. Thus, the desired probablility is 

7!- P(8,5) T 


Pa= Tor = 99° 


Solution 1 to Question 2 


There is a total T = (m + o + b)! of ways to plant the trees. Maples and oaks can be 
planted in M = (m +o)! distinct ways, creating m+o+1 places for single birch trees. 


1)! 
tefi 7 unique ways. The probability we 


These can be planted there in N = lmr otd] 


are after is 


M-N (m+oọo)!-(m+o0+1)! 1 
T | (m+o+1—b)!  (m+o0o+b)! 


_ (m+o)!:(m+o0o+1)!-b! 1 
(m+o+1—b)! “(m+o-+b)!-! 


(m+0o+1)!  (m+o)!b! 
(m+o+1—b)!b! (m+o+b)! 


m+o+1 
(=) 

mt+tot+b\- 
ye) 
Solution 2 to Question 2 
Amit Itagi 


This is the same as the distinguishable case other than the fact that the number of 
ways with and without the constraint both get scaled down by 3!4!5!—the number 
of permutations among the trees of the same kind. As a result, the ratio and the 
probability do not change. 


Notes 

Perhaps surprisingly, the two cases produce exactly the same result. No less surpris- 

ingly, the answer is a fraction of two binomial coefficients which are the staples of 

problems dealing with indistinguishable objects. As a matter of fact, the distinction 

between maples and oaks is a red herring [19]. The only quantity that matters is their 

total amount. However, the distinction between maples and oaks on one hand and 

birch trees on the other has to be accounted for in the solution. 

m+ot+b 
b 

the trees. The remaining trees create m-+o+1 spaces to be filled individually by those 

selected. 


There are a total of ( ) ways select “birches” among the whole bunch of 
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7 
The answer in both cases is 99° Finding it with the specific number of each type 


of tree makes that fact more surprising than it deserves to be. The general (algebraic) 
solution sheds some light on the commonality of the two cases. 


Six Numbers, Two Inequalities 


Solution 1 to Question 1 
A. Bogomolny 


Unlike the earlier Riddle [L9] on page|3| the two questions have different answers. 

For both questions, the sample space counts 6! elements. The task is to count how 
many of these satisfy both inequalities. What is important is how many combinations 
of the numbers adjacent to the inequality symbols satisfy the inequalities. For the first 
question, we multiply that number by 3! to account for the remaining three digits. For 
the second question, the factor is 2!. 

For the first question, if the first number is 1, there are 10 choices for the other two: 
(2,3), (2,4), (2,5), (2,6), (3,4), (3,5), (3,6), (4,5), (4,6), (5,5). Similarly, if the first 
number is 2, there are six variants for the other two numbers. Then there are three 
variants if the first number is 3 and only one if the first number is 4. In all, there are 
(10+6+3-+1)-3! combinations that satisfy both inequalities. The probability of this 
20-3! 1 


6. 6 


happening is 


Solution 2 to Question 1 
Leon Stein 


For any triplet, there are six ways to put the numbers in three spots, but only one 


1 
holds ordering. Thus, only = of total arrangements are valid. 


Solution 1 to Question 2 
A. Bogomolny 


6 
For the second question, there are ( ) ways to fill the slots around the inequal- 


ity symbols. For every choice of the four numbers, there are six ways to satisfy the 
two inequalities, e.g., (1,2,3,4), (1,3, 2,4), (1,4, 2,3), (2,3,1,4), (2,4,1,3), (3,4, 1, 2). 


6 
Thus, the total number of “successful” combinations is (‘) - 6-2, giving the sought 


5-6-2 1 
probability as oe aie 


Solution 2 to Question 2 
Long Huynh Huu 


Let us observe that the two inequalities are independent (Appendix[A) because swap- 
ping the digits in one pair does not affect the other. Thus, borrowing from the logic of 


1 1 1 
the first question, the answer is 5 x 35T 
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Six Numbers, Three Inequalities 


1 5 
The answers to the questions are z4 and vt respectively. 


Solution to Question 1 
A. Bogomolny 


For the first question, we choose four digits to place next to the inequality signs. These 
can be permuted in 4! = 24 ways, of which only one satisfies all three inequalities. We 


1 
conclude that the probability of this happening is — 


24° 
6 ! 
We can do a more complete counting: there are ( 5) = oa ways to choose four 
numbers out of six. The other two can be permuted in 2! = 2 ways. Thus, the sought 
a.. 6!2! 1 
probability is aiaigl T FP as before. 


Solution to Question 2 
A. Bogomolny 


For the second question, observe that if four numbers u,v, w, x fit the inequalities in 
the first question, i.e., u < v < w < x, then 


uULW>V<T 
U<xr>uvu<w 
vcw>u<2@ 
v<r>u<w 
worrucy. 


Thus, to every solution of the first question there are five corresponding solutions 
to the second. It follows that the probability of a random sequence satisfying the 


; EEEE s u 9 
inequalities in the second question is z 


Tying Knots in Brazil 
A. Bogomolny 


Solution to Question 1 


There are three ways to tie the loose ends below the fist: a + b, a & c, boc. 
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Two of them (a & b, a & c) lead to one long string so that the probability of that 


ti g 
event is x=. 
3 


Solution to Question 2 


Given four string ends, when one knot is tied, the remaining two ends are left to be 
automatically tied. There is no choice. 


‘ay 


$ 

$ 
$ 
d 
H 
= 
H 
1 
L 
a 
a 
o 
E 
F 
m, 
b 


4 
There are ( 3) = 6 ways to choose two strings out of two, but there is doubling of 


the count as any two pairs of strings are counted twice. All in all, there are just three 


distinct ways to tie the lower ends: a œ b +c d,a cbed, asd++bec. 


One of these creates two small rings with two knots each whereas two yield a single 


2 
ring with four knots. Thus, the probability of getting a long ring is -. 


Solution to Question 3 


Tying two knots above the fist leaves us with this configuration: 
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2 2 
long string, the end a needs to be tied. There are four ways to accomplish this. Joining 
it with either b, c, d or e will leave one end loose. Say, if a is tied to b, c needs to be 
tied next. There are two ways to accomplish this. Thus, in all, there are 4-2 = 8 ways 
to tie two knots so as to form one long string. Thus, the probability of this happening 


1/5)\ (3 
There are = ( 5) ( ) = 15 ways to form two distinct pairs of knots. To obtain one 


equals j 
uals —. 
AE e 


Tying Knots in Russia 


Solution 1 to Question 1 
Six grass blade ends could be tied pairwise in 5-3-1 different ways (the first end could 
be tied with any of the remaining five; the second, with any of the remaining three). 
This is true both for top and low ends, giving a total of 15 x 15 possible outcomes. 
The question is how many of these have a favorable outcome, i.e., result in a single 
ring? Let us enumerate the blades 1, 2, 3, 4, 5, 6, and assume that, at the top, 1 is 
tied to 2, 3 to 4 and 5 to 6. At the bottom, 1 may be tied to 3, 4, 5 or 6; assume it is 
tied to 3. Then 2 may be tied to 5 or 6. In all, there are 2 x 4 different ways to tie the 
bottom ends for each of the 15 different knot combinations at the top, giving 15 x 8 
combinations that produce a single ring. Thus, the probability of this happening is 
120 8 


Solution 2 to Question 1 
From Riddle we know that the ends of five strings are tied into one long string with 


8 
the probability of I5 Add an additional string to tie a knot at the top and another at 


the bottom to obtain a complete ring. This shows that there is a 1-1 correspondence 
between arrangements of five strings that result in one long string and the arrangement 
of six strings that results in a single ring. 


Solution to Question 2 


For 2n grass blades, the top strings can be tied in (2n — 1)!! ways, where 
k! =k-(k—2)-(k—4)...X, 


where X = 1 if k is odd and 2 if k is even. Thus, there is a total of ((2n — 1)!!)? 
possible knot arrangements. As in solution 1 to the first question, for each of (2n — 1)!! 
arrangements at the top, there are (2n — 2)!! knot arrangements at the bottom, giving 
the probability of getting a single ring as 

(2n— 1)! x (2n-— 2)!  (Q2n— 2)! 


-R ((2n — 12 ~ Qn—1)N 


Notes 


The bride-to-be may be lucky to see a single loop formed by the grass blades, but, 
in general, there will be more loops than one. The expected number of such loops is 
discussed elsewhere, Riddle on page [196] 
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Guessing Hat Numbers 


Solution to Question 1 


First, let us solve a more straightforward problem: 


Five students blindly choose a hat among six numbered 1,...,6. Each 
announces the largest and the smallest number he/she sees. What is the 
probability that each will be able to deduce his/her own number? 


As there are six hats and five students, exactly one of the hats is bound to be left 
over. The sample space for the problem consists of six elements, which could be the 
omitted elements or the sequences of numbers on the hats that the students are wearing. 

As a general remark, if the selected hats bear numbers a, < az < a3 < a4 < Gs, 
then the numbers a1,a2,a4,a5 will always be called out. (For example, a4 will be 
called by the wearer of the hat with as.) The question is whether the number a3 can 
be deduced with certainty. This will be the case when a4 — a2 = 2, i.e., when ag is 
tight between az and a4. This will happen if the omitted number is one of 1, 2, 5, 6; 
the ambiguity will occur when either 3 or 4 is left over. 

Thus, every student will be able to deduce his/her number in four out of six cases, 


4 2 
giving the probability of the correct deduction as o3 


For the question where students need to guess their numbers, at least four of them 
will always be able to deduce theirs, i.e., to guess them with the probability of 1. In 
four out of six cases, the remaining person will also be able to deduce his/her number. 
In two out of six cases, the wearer of a3 will have 1 : 1 chance to make a correct guess. 
The total probability then comes to: 

4 


be : 
6 62 6 


Solution to Question 2 


As in the solution to the first question, we first look at the case where all the students 
may or may not deduce the numbers on their hats. We consider n > 6 and, for k, 
several cases. Let P be the sought probability. 

If k = 1, P = Osince a single fellow has no way to deduce the number on his/her hat. 

If k = 2, P = 1 because two students call out the other’s number. 

If k = 3 or k = 4, P = 1 because all numbers will be eventually called out, and 
each student may pick the number he/she does not see. 

Therefore, we focus on k > 4. Let the numbers on the hats, in the increasing 
sequence, be 


ay < a2 < +++ < Ak—1 < Qk. 


Observe that the numbers a, and a, will be called out k — 1 times each. Numbers 
a2 and az_— 1 will be called out just once. If all k numbers are consecutive, i.e., if the 
above sequence has no gaps, then, as in the case k = 3 or k = 4 above, each of the 
students will be able to deduce his/her number. It will be the one he/she does not see. 

If there are gaps, some of the students will not be able to deduce their numbers. 

The gaps are absent if aķ-1 — ag = k — 3 so that between az and aķ—ı there is 
enough room for exactly k — 4 numbers. For az we must have 2 < ag < n — k + 2; if 
a2 = i > 2, then for a, there are i — 1 choices and for az, n — k — i + 3 choices. The 
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total number of sequences that allow the students to surmise their hat number is 


n—k+2 


XO (@-Din-k-t+3) 


i=2 


n—k+2 
= De (n—k+4)i—(n—k+3)) 
(n 


6 
g k+4)(n a k+3) (net jn kia) 
=(n—k+2)(n k+3)| n sh 2 4 on ik | 
o (n=k+3)(n-k+2)(n-k+1)_ (n—-k+3 
l - (1), 


It follows that the probability of all the students being able to deduce their hat 
number is 


is 
pe 


Now let us turn to the “guessing” problem. The latest formula is the key. Let 7 be 
the total length of the gaps in the sequence az < --- < ag_y. 1 < j < n— k. This is the 
same as the number of students who have no choice but to make a guess. For every j, 
there are 


(" ga H e aa 


such sequences, making the total equal to 


ee ae oe 


In itself, this sum does not make much sense because, for different 7, the probabil- 
ities of guessing right differ; in every case they are (j + 1)~4. However, the total can 
be verified manually. For example, as we saw, for n = 6 and k = 5, the only possible 
value for j is 1, so we have 


( aaa Ee 


as expected. For n = 7 and k = 5, there are two possibilities: 7 = 1 and 7 = 2. The 
formula gives 


()()+QG)-trr-9-m 
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Here are all 11 cases (an asterisk denotes the omitted numbers that ought to 
be guessed): 


1 2 3 x x 6 7  (j=2) 
1 2 « 4* x 6 7 (j = 2) 
1 2 x x 5 6 7 (j=2 
1 = 3 4 x 6 7 (j=l 
1- 3 Bese 6 7 (G=) 
-2 3 4&4 s 6 7 (=D 
-2 3 x 5 6 7 (j=1) 
Da oS eS i) 
1 2 s 4 5 -= 7 (j=1) 
1 2 3% x 5 6 — (j=l) 
1 2 x 4&4 5 6- OG SD 


The general formula for the probability of all students being able to guess correctly 
appears to be 


a 


G 


Probability of an Odd Number of Sixes 


If 0< k <n, the probability of k sixes turning up in a random toss of n fair dice is 


OJOO 


1 
Hence, with a = ° and b = z the required probability is 


OG) @ 


= sum of every second term in the expansion of (a + b)” 


-0 
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Naturally, P+ Q = 1. Why is Q > P? The graphics below may be suggestive. 


Zero is an even number always lurking in the background. 


Probability of Average 


Solution 1 
Vrund Shah 


To have a whole average, the largest and the smallest numbers of the three drawn need 
to have the same parity. If they do, the third number is determined uniquely. 


10 
There are e = 45 pairs of even numbers in the set 1,2,...,20 and as many 


pairs of odd numbers. This means there are exactly 90 ways to choose three numbers 
out of 20 so that one of them is the average of the other two. Altogether, there are 


20 
( 3 ) ways to choose three numbers, all equiprobable, giving the sought probability as 


pG) w. 


20 ~ 1140 38° 
3 
Solution 2 


Hélvio Vairinhos 


For each ball i = 2,...,10, the number of pairs which have i as their average is 


i — 1, which totals = 45. The same is true for i = 11,...,19, due to symmetry: 


i++ 20 — i. Hence, the total number of choices is 45 - 2 = 90, out of e 


Solution 3 
Amit Itagi 


Let the smallest number chosen be k. To satisfy the average condition, the other two 
balls need to be numbered k + d and k + 2d for some positive integer d. Moreover, 
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k 
k + 2d < 20. Thus, if k is odd, d € {1,2,...,10 — 5}; if k is even, d € {1,2,...,10— 


k+1 
="). Thus, the required probability is 


1 k+l k 
Pa ips 10- Ë 
O(20, 3) 2 (w 2 )+ De (w =) 
kE{1,3,5,...,19} k€{2,4,6,...,20} 
2 10 
ee SG (io 
C(20, 3) > m 
2.6-45 3 
=a g I 


Solution 4 
Joshua B. Miller 


The number of overlapping triples (i — 2k, i — k, i) 
1. for k=1,...,9 

2. for k = 1, i € [3,20], so the number is 18 

. for k = 2, i € [5,20], so the number is 16 


3 
4. ... and so on 
5 


. for k = 9, i € [19,20], so the number is 2. 


-1 
The number of overlapping triples is 2- a = 90 while the total number possible 


is @: 


Marking and Breaking Sticks 


For the case of equal pieces, one of the marks ought to be located on one of the pieces. 
The second mark is located on the same piece with the probability of 

For the random lengths, imagine that first n — 1 marks are placed at the break 
points, making the total number of marks n + 1. There are (" a ‘) ways to pick two 


marks out of n+ 1. Of interest are those in which two original marks follow each other, 
with no “break” marks in between. In other words, if the marks are numbered from 
1 through n + 1, we are interested in the distributions where the original marks bear 
successive numbers: 1 and 2, or 2 and 3,..., or n and n+ 1. There are n such cases 


1 
(out of u D implying that the sought probability is 


Cog (n+1)! n+1 
2 


n n-2!(n—1)!_ 2 


154 Cut the Knot: Probability Riddles 


This shows a rather remarkable increase compared to the case of equal pieces. 


Bubbling of Sorts 


Solution 1 to Question 1 

In order not to be pulled down, a number needs to be larger than the ones preceding it; 
to be drawn up, it also needs to be larger than some following it. To get into position 
#30, a number needs to be larger than the previous 29 terms while smaller than term 
#31. If we denote the two numbers r3o and r31, then this means that rz, is the largest 
of the 31 numbers and r3o is the next largest. There are 31! permutations of 31 numbers 
and 29! permutations that fix two numbers out of 31. The probability is thus 


29! 1 1 


P=% = =—. 
31! 30-31 930 


Solution 2 to Question 1 


To get into position #30, a number needs to be larger than the previous 29 terms while 
smaller than term #31, but nothing makes ro especially remarkable. The largest of the 
first 30 numbers could have been in any position, not necessarily position #20. Thus, 


1 
the probability that rao is the largest among the first 30 numbers is 30° Similarly, The 


number in position #31 that stopped the advancement of r2ọ is the largest among the 
first 31, while the probability of the largest number among 31 to be in the position 


1 
#31 is 31° The two events being independent, the probability of both happening is 


1 1 
30-31 930° 


Solution to Question 2 

On the first pass, as the advancement of rəọ was stopped by r31, the latter could have 
been up on the next step so as to bring into position #31 a number smaller than rag 
(the one that is in position #30 after the first pass). For this not to happen, the number 
r32 needs to be larger than r2ọ. Thus, the sample space consists of 32! sequences in 
which one position is fixed and two are interchangeable, making the probability for the 
second question 


291-2 _ 2 © 1 
32! 30-31-32  14880` 


Solution to Question 3 
Christopher D. Long 


For n bubble passes, r2ọ, in order to stay in position #30, needs protection of n 

numbers, each larger than r2ọ. These are located in positions 31, 32,...,30 +n. Their 
29!- n! 

(30 +n)! 

After 10 passes (or less) the terms in positions 31—40 will be correctly ordered. If 
the same is true (which is not assured) for positions 1-29, the algorithm will stop. 
If it does not, it will continue to reorder the terms in those lower positions without 
affecting r2ọ in position #30. 


order is inconsequential. This gives the probability P = 
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Notes 


Antonio Catalano and later Mathew Crawford have interpreted the second question 
as independent from the first: 


Assume the first question was never asked. Then what is the probability 
that the number that begins as r2ọ will end up in position #30 after two 
bubble passes? 


We might as well ignore positions 33-40. Relabel 1-32, for convenience, 
with 30 in spot 20 and either 31 or 32 in spot 32. Bubbling small numbers 


2 
backward from the end helps perspective. The answer is 39.31" 
Probability of Successive Integers 

86 
A 5 
The answer is 1 — ~—— #& 0.2076. 
90 
5 


The tricky part is counting the number of ways to choose five non-adjacent num- 


Solution 1 
Josh Jordan 


bers from 10 through 99. Below I show that this is (*) using a 1-1 correspondence 


involving adding/removing padding to bit-strings. 

For simplicity, it is helpful to first subtract 9 from each of the numbers, so they 
range from 1 through 90. 

Choose five non-adjacent numbers from 1 through 86. Write them as a bit string 
in which there is a 1 in position N just when N was chosen. 

For example, the choice {2, 3, 6,8,86} would look like this: 


(86 bits) 
SOO TO 
0110010100...001. 


Now add a zero between each pair of ones: that will be four additional zeros. The 
string becomes the following: 
(90 bits) 
61010001001000...00 
The second string represents five numbers chosen from 1 through 90 with no adja- 


cent numbers. To convert any such string to a selection of numbers from 1 through 86, 
remove a zero between each pair of ones. 


Solution 2 
Amit Itagi 
First, consider the event that no two chosen numbers are adjacent. Let the five chosen 
numbers sorted in increasing order be a; (j = 1,2,...,5). Define five groups of consec- 
utive numbers: G = (a1,...,@2 — 1), Go = (a2,...,a3 — 1), ..., G4 = (a4,..., a5 — 1) 


and G's = (as,...,99). Note, numbers less that a; do not fall in any of the five groups. 
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Lump those numbers in a group Go. For no two of the five numbers to be adjacent, the 
groups G;—G need to have at least two elements. G5 will have at least one element, as 
as < 99, and Go can have no element. If N; is the number of elements in group G;, the 
number of ways of choosing the numbers under the constraint is the number of solutions 
of No + Ny + No +---+ N5 = 90, such that N; >2for0<i< 5, Ns > 1 and No >0. 
This is equivalent to the number of solutions of No + N1 +N2 +- --+N5 = 90—4+1 = 87 


under the modified constraint N; > 1. The number of solutions is & 7 i) = y 


Thus, the required probability is 1 — ~—— 


Solution 3 
Stephen Morris 


Count the choices that fail. These all start with even numbers. Starting with the highest 
and working down, randomly decide whether to increment it and all higher numbers. 
If the highest is 96 or 98 we can choose to increment max 3 or 1 times: 


4 4 44 
PERIME NEL 
90 l 
5 
106081 
511038" 


The exact result is 


Solution 4 
Christopher D. Long 


We can use an urn model. Let x2,...,%5 > 1, xı > 9, zę > 0 and z1 4+---+275+2%6 = 

99—5 = 94. Our numbers y; for i = 1, 2,3, 4,5 equal y; = 41 +---+a;+7. The number 
86 

(5) 


(s) 


4—4— 
of ways to assign these values is C ae ’) = & 


5 S); hence p= 1- 
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Conditional Probability 


How dare we speak of the laws of chance? Is not chance the antithesis 
of all law? 


Joseph Bertrand, 1889, Calcul des Probabilités 


Riddles 


7.1 Two Liars 
Problem 2.4.9] 


1 2 
A and B tell the truth with the probability of 3 and lie with the probability of z 
A makes a statement, and B confirms that the statement made by A is true. 


What is the probability that A is actually telling the truth? 


7.2 Three Liars 
Problem 2.4.10] 


1 2 
A, B and C tell the truth with the probability of 3 and lie with the probability of z 


A makes a statement and B makes an observation about whether A’s statement is 
true or not. C tells us that B confirms that A is telling the truth. 
What is the probability that A is actually telling the truth? 


7.3 Taking Turns to Toss a Die 

pp. 59-61] 

Three players—A, B,C—take turns tossing a fair die, following the sequence 
ABCABCABC .... This continues until one of them comes up with a 6 on top. That 
player drops out of the game while the other two continue to take turns until one of 
them tosses a 6, at which point the game ends. 


e What are the probabilities for each of A, B,C to be the first to exit the game? 


e What are the probabilities of the exit sequences AB, BA and CB? 


7.4 Two Coins: One Fair, One Biased 
Problem 1989-4] 


1 2 
Two coins are given. One is fair: P(heads) = z The other coin is biased: P(heads) = 3° 


One of the coins is tossed once, resulting in heads. The other coin is tossed three times, 
resulting in two heads. 
Which coin is more likely to be the biased coin, the first or the second? 
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7.5 Probability of the Second Marble 
Problem 318] 


A bag contains marbles of two colors, black and white; all combinations of col- 
ors (including all marbles being of the same color) are equiprobable. You randomly 
(blindly) pick a marble from the bag. It is black. 

What is the probability that if you picked a second marble at random it would also 
be black? 


7.6 Quotient Estimates | 

Two numbers X and Y are randomly chosen from the interval [0,1]. Let r denote the 
max{ X,Y} 
min{X, Y} 
assumptions, there are three questions. For a given k > 0, what is the probability 
that r > k 


quotient of the larger of the two by the smaller: r = Depending on the 


1. when X is uniformly distributed on [0, 1] and Y is uniformly distributed on [X, 1]? 
2. when X,Y are jointly uniformly distributed on {(2,y): 0<a<y<1}? 


3. when X and Y are uniformly and independently distributed on [0, 1]? 


7.7 Quotient Estimates II 

[ri] 

n > 2 random variables X;, i = 1,2,...,n, are each uniformly distributed on the 
max {X;} 

interval [0,1]. If r = en what is the probability that r > k for a given k > 1? 
ep A 


7.8 The Lost Boarding Pass 


On a sold out flight, 100 people line up to board the plane. The first passenger in the 
line has lost his boarding pass but is allowed in, regardless. He takes a random seat. 
Each subsequent passenger takes his or her assigned seat if available or a random, 
unoccupied seat otherwise. What is the probability that the last passenger to board 
the plane finds his seat unoccupied? 


7.9 Lucky Contest Winners 


Students at your school have just finished competing in the qualifying round of a 
nationally sponsored contest on mathematical reasoning and sense making. When the 
work is scored, it turns out that four students at your school all have perfect preliminary 
papers—two girls and two boys. The school decides to hold a random drawing among 
these four students to select two of them to send to the national finals. The drawing 
takes place in the school auditorium. You show up late to the drawing, just as one of 
the winners—a girl—is leaving the stage amid cheers. 


1. Suppose that the girl you saw leaving the stage is the first winner. What is the 
probability that the second winner will also be a girl? 
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2. Suppose that the girl you saw leaving the stage is the second winner. What is 
the probability that the first winner was also a girl? 


3. You pass a door to the auditorium, peek in, see a girl descending the stage but, 
being pressed for time, close the door and continue on your errand. You do not 
know whether the girl you saw leaving the stage is the first or the second winner. 
What is the probability that the other winner is also a girl? 


4. Having missed the whole ceremony, you eventually call the principal and inquire 
whether there was a girl among the winners. The principal responds in the affir- 
mative. What is the probability that both girls were selected? 


7.10 Diminishing Hopes 
Problem 2, 4, 13a] 


A fellow’s desk sports eight drawers where he randomly (but with equal probabilities) 
stores his documents. This would be the whole condition of the riddle except that in 
two out of 10 cases the fellow simply forgets to store a document, and eventually the 
latter gets lost. 

When the fellow needs a document, he starts a search from the first drawer and 
proceeds sequentially until the document is found or it becomes clear (after checking 
all the drawers) that the document has not been stored in the desk in the first place. 

There are several questions: 


1. The fellow checks and finds no documents in the first drawer. What is the prob- 
ability that the document will be found in the remaining seven drawers? 


2. The fellow checks and finds no documents in the first four drawers. What is the 
probability that the document will be found in the remaining four drawers? 


3. The fellow checks and finds no documents in the first seven drawers. What is the 
probability that the document will be found in the last remaining drawer? 


7.11 Incidence of Breast Cancer 

In a study, physicians were asked what the odds of breast cancer would be in a woman 
who was initially thought to have a 1% risk of cancer but who ended up with a posi- 
tive mammogram result (a mammogram accurately classifies about 80% of cancerous 
tumors and 90% of benign tumors). Ninety-five out of a hundred physicians estimated 
the probability of cancer to be about 75%. Do you agree? 


7.12 A Search for Heads and Its Consequences 


Jack flips a coin three times and writes down the outcomes. Next, Jack circles one of 
those outcomes that is immediately preceded by heads. What is the probability that 
the marked flip is a heads, conditional on 


1. its being the second flip? 
2. its being the third flip? 


3. there being no additional information on the selection? 
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7.13 A Three Group Split 


If the integers 1 to 9 are randomly distributed into three sets of three integers, what 
is the probability that at least one of the sets contains only odd integers? 


7.14 Lewis Carroll’s Pillow Problem 


pp. 188-189], pp. 129-132] 


A bag contains a counter, known to be either white or black. A white counter is put 
in, the bag is shaken and a counter is drawn out, which proves to be white. What is 
the chance of drawing a white counter now? 


7.15 A Follow Up on Lewis Carroll’s Pillow Problem 
This problem was inspired by Chris Conradi’s letter. 


Suppose a counter is drawn at random from a bag containing b black and w white 
counters. Without revealing its color, that counter is placed in a second, empty bag. 
Then, a white counter is added to this bag. Now, you draw a counter from this second 
bag and it turns out to be white. What is the probability that the remaining counter 
is white? 


7.16 Sick Child and Doctor 
p. 48] 


A doctor is called to see a sick child. The doctor has prior information that 90% of sick 
children in that neighborhood have the flu, while the other 10% are sick with measles. 
Let F stand for an event of a child being sick with the flu and M stand for an event 
of a child being sick with measles. Assume for simplicity that F UM = Q, i.e., that 
there are no other maladies in that neighborhood. 

A well-known symptom of measles is a rash (the event of having which we denote 
R). P(R|M) = .95. However, occasionally children with flu also develop a rash so that 
P(R|F) = 0.08. 

Upon examining the child, the doctor finds a rash. What is the probability that 
the child has measles? 


7.17 Right Strategy for a Weaker Player 
Problem 316] 


A and B are to play a two game chess match. If they are tied after two games, they will 
continue until the first win. A is the weaker of the two players. With a daring game, 
A’s probability of winning is 45% and of losing is 55%; with a conservative game, A’s 
probability of a draw is 90% and of losing is 10%. Which of A’s strategies could lead 
to his winning the match? With what probability? 


7.18 Chickens in Boxes 
Four boxes are arranged in an L shape, Figure [7.1] 
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e peN 


Figure 7.1: Four boxes. 


Each box has only enough room for a single chicken. Each of the boxes has the 
1 
probability of = to receive a chicken, subject to the requirement that exactly one of 


the vertical boxes (1, 2 or 3) contains a chicken and that exactly one of the horizontal 
boxes (3 or 4) contains a chicken. 


1. What are the chances that the corner box contains a chicken? 


2. Which of the boxes is most likely to contain a chicken? 


7.19 Two Chickens in Boxes 


Two chickens are randomly placed into four boxes that are arranged in an L shape, 


Figure 


Each box has only enough room for a single chicken. Each of the boxes has the 


1 
probability of 5 to receive a chicken, subject to the requirement that at least one of 


the vertical boxes (1, 2 or 3) contains a chicken and that at least one of the horizontal 
boxes (3 or 4) contains a chicken. 


1. What are the chances that the corner box contains a chicken? 


2. Which of the boxes is most likely to contain a chicken? 


7.20 Two Chickens in Bigger Boxes 
Two chickens are randomly placed into four boxes that are arranged in an L shape, 
Figure 

Each box has enough room for two chickens. Chicken placement is subject to the 
requirement that there be at least one nonempty vertical box (1, 2 or 3) and that there 
be at least one nonempty horizontal box (3 or 4). 


1. What are the chances that the corner box contains a chicken? 


2. Which of the boxes is most likely to contain a chicken? 
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Solutions 


Two Liars 


Solution 1 
A. Bogomolny 
Both A and B tell the truth in one of three cases and lie in two of three cases. The 
three cases for A and B can be visualized as a 3 x 3 square. 

B could claim that A’s statement is true if both were telling the truth or if both 
lied. There are five red squares that correspond to these conditions. In only one of these 
(the upper left corner) was A actually telling the truth (Figure [7.2). This happens in 


1 
one of five cases, i.e., with the probability of 5 


A tells 
truth 


A lies A lies 


B tells 


truth 


B lies 


B lies 


Figure 7.2: Two liars. 


Solution 2 
Christian Adib 


Let E be the event that A makes a statement and B confirms. This can happen in 
two ways: 

e T, both A and B are telling the truth; 

e L, both A and B are lying. 
11 m 2 2 5 
33 33 9 

Given that we observe E, we are interested in finding the probability that the 
underlying process is in fact T, that is, P(T|E). From the formula for conditional 


P(E) = P(T) + P(L) 


P(TQAE 
probabilities, P(T|E) = a and P(T N E) = P(T) because T C E. In 
conclusion, 
1 
P(T) 9 1 
P(T|E) =>; rn. 
(TIE) = Boy = 5 8 
9 
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Three Liars 


Solution 1 
A. Bogomolny 


The tree shown in Figure [7.3] represents all possible cases wherein C tells us that B 
confirmed A’s statement. 


C tells 
truth 


B tells 
truth 


Atells B lies 
truth 


C lies 


B tells Selos 


truth 


C tells 
truth 


B lies 


Figure 7.3: Three liars. 


Counting the probabilities gives the total, 


pal 1 1 2 2\ 2 /12 21 
-3 83 3'3 3) 3 (3 3'3 3 


_ 1b 
OT 
This corresponds to 13 cases out of 27 wherein C tells us that B confirmed A’s 


statement. The first addend in the equation above corresponds to 1-1+2-2 = 5 cases 
wherein A’s statement was indeed true. Therefore, with the information at hand, we 


5 
m : 27 5 
may conclude that the probability that A was telling the truth is BTB 
27 
Solution 2 
Amit Itagi 


1 2 
Let t= z and f = z. 


164 Cut the Knot: Probability Riddles 


A Speaks | B Speaks | C Speaks C Can | Probability 

Truth Truth Truth Make the 
Statement 

0 0 0 0 f? 

0 0 1 1 tf? 

0 1 0 1 tf? 

0 1 1 0 tef 

1 0 0 1 tf? 

1 0 1 0 tf 

1 1 0 0 ty 

1 1 1 1 t 


P(A speaks truth|C can make the statement) 


_ P(A speaks truth N C can make the statement) 
7 P(C can make the statement) 


tf? + +f? 1? +22 5 


T 3f 2 43f2 1243-22 13° 


If the properties in the first four columns of the above truth table are labeled by 
Pk, where k is the column number, then the above truth table gives P4 as 


Pj, Po. Py: Pp. Py Py + PP Pe Pi. P.P 


using logic gates or as 
((4P1) A (Pa) A P3) V ((4P2) A CPs) A Pi)v 
((>P3) A (P1) A P2) V (Pi A Po A P3) 
using propositional logic. 


Taking Turns to Toss a Die 


Solution 1 

Let P4, Pg, Po be the probabilities of A, B,C, respectively, being the first to exit the 
game. Concerning P4, when it is A’s turn, he either tosses 6 and the game stops or all 
three toss anything but 6 and the game continues: 


1 /5\’ 
P=- -] P 
A =+) A; 


from which P4 = n For B to be the first to exit the game, A needs to miss 6 followed 


by B tossing 6. If this does not happen and they all (in the BAC sequence) miss 6, 
the game continues: 


5 1 5\3 
Pees OP 
B= =+ (3) B, 
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from which Pg = For Po, we have 


5\? 1 5\3 
Poe=(?) ahi hes | P 
Bea) ear ae 


25 
which resolves to Po = 91° As a confirmation, PA + Pg + Po =1. 


With A out of the game, the sequence of throws becomes BCBC .... If pı is the 
probability of B being the first to toss a 6 then, as before, 


1 5V 
PSs Š P. 
1 z+ (5) l; 


yielding pı = > Obviously for C to get ahead of B the probability p2 = a These 


probabilities remain valid for a two person sequence, regardless of their identities. 
Observe that when, e.g., B exits first, the game proceeds in the sequence CACA.... 
Thus, 


36 6 216 
ABT A or i= 00 
30 5 150 
A ES of L I0 
25 6 150 
PAREEN E IIL 
25 5 125 
POR he PS E T 1001 
36 5 180 
PAC Ee TES A Tl 
30 6 180 
SBOT EACE S 00 


It can easily be checked that all six add up to 1, as they should. Perhaps surprisingly 
Pga = Poa and Pac = Pgo. 


Solution 2 
Amit Itagi 
Let A; be the event that A is the kt? person to exit (with a similar notation for B 
and C): 
3 6 l 
1 5 5 6 36 
P(A)=- |1 = = Se 
a= f+ (8) + (8) + fee 
(a) 
5 30 
P(B,) = | =) P(4) = = 
B) = ($) Pan = i 
5 25 
P = | - ] P(B) = =. 
(1) = (2) PB) =2 
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P(B2|A1) = P(C2|B1) = P(A2|C1) 


1+ (2) +(2) + 
ea 


P(C|A1) = P(Aa|B1) = P(Bo|C;) = 1— P(Ba|A1) = =. 


1 


6 


11 
Thus, 
P(A, N B2) = P(Bo|A1)P(A1) = (=) (=) z n 
P(A 0C) = P(CaAx)PAn) = ($) (5) _ ay 
P(B, N Cy) = P(C2|B,)P(B,) = (5) (#) _ 
P(By N Az) = P(A2|B1)P(B1) = (=) (=) T nor 
P(C1N A2) = P(Aa|C1)P(C1) = (=) (=) Tae 
P(C, N B2) = P(Be|C1)P(C1) = (=) @ r aaa 
Solution 3 
N.N. Taleb 


The probability of the event occurring at period i is p(1 — p)*~+. Let P(.) be the 
probability that player A, B or C exits first. 
Skipping three for each, we get 


1 _ 36 
(p—3)p+3 91° 


P(A)= p(1—p)* = 
1=0 


= f l-p 30 
ic X _ 3i+1 _ = 


(p— 1)? _ 25 
(p—3)p+3 ~ Qt" 


P(C) =X p - p)” t? = 
1=0 


We note that events A, B and C are separable and exhaustive; hence, the proba- 
bilities sum to unity. This will help in what follows because every combination starting 
with A, B or C will entail P(A), P(B) or P(C), respectively. 
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Now we condition AB. Since we have the sequence BCBCBCBC... (after A) and 
need to skip two for each, starting with p 


(4) Spl -p = PAs = 


We also condition BA. Since we have the subsequent sequence ACACACAC... 
(after B), we need to skip two for each, but starting with p(1 — p) 


2 ; l—p 150 
B 1—p)***! = P(B)—— = —_.. 
DD p) ( lJa 1001 


As for CB, we have the subsequent sequence ABABABABABAB... (after C): 


i= 125 
1—p)*" = P(B n 
a ( ar p 1001 


Two Coins: One Fair, One Biased 


Solution 1 


Assuming the fair coin is the one that was tossed, the probability of it showing heads 
and of the other coin showing two heads is given by 


OG (3) =3 


On the other hand, if the biased coin was tossed once then the probability of the 
observed result is 


2 (NAVAS 2 
ae 2) 8 
telling us that, more likely, it was the biased coin that was tossed once. 


Solution 2 
Amit Itagi 


Without any a priori knowledge, let us assume that the first coin is equally likely to 
be either of the two coins. Let D denote the observation and B the event that the first 


coin is biased. Thus, P(B) = P(B) = > 
P(D|B) = 5 | -|E(3, 2) (3) G) A 
POB) = |3| -c62 ei G) 2 
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Using the Bayes theorem, 


Ae P(D|B)P(B) 
P(D|B)P(B) + P(D|B)P(B) 


Thus, the observation makes the first coin more likely to be biased. 


Solution 3 
Joshua B. Miller 


Prior (relative) odds are even, while the likelihood ratio is 


2 /1\° 2 1 
pp a =- (3) -38 
Pr(E|F,B) 1 /2\? 1 1 4 
9 
= 


Odds in favor of the first coin being biased and the second coin being fair are 9 : 8, 


9 
i.e. bability of —. 
i.e., a probability of 77 


Probability of the Second Marble 


Solution 1 
A. Bogomolny 


Assume the bag at hand contains n > 0 marbles. There are n + 1 possibilities for the 
number of black marbles, ranging from 0 to n; all n+1 are said to be equiprobable. Let 
Ng denote the event that there were k black marbles to start with. Let Bı denote the 
event that the first drawn marble was black; Bə is the event that the second marble 
was black. 
1 
Thus, the probability that the bag at hand carries i black marbles is P(.N;) = Te 
n 

for any i = 0,...,n, and the probability that a randomly drawn marble from that bag 


is black is P(B,|Ni) = = Then, by the Bayes theorem, the probability that the bag 
n 


from which a black marble was drawn has k black marbles is 


k 
_ O P(BIN PN) __ nat) _ k 
P(N;|B1) = So P(BiIN)P(M:) = = se ee 
i= n(n + 1) 


_ 2k 
~ n(nt+1) 
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Now, if the event Nz took place the probability of the second marble being black 


-—1 
i and the total probability of the second marble being black is 


7 2k k-1 2 7 
P= a a ea a 
2 LE 
~ n(n +1)(n—1) dk =k) 
E 2 (a +1)(2n +1) n(nt+ 2) 
~ n(n+1)(n— 1) 6 2 
1 /2n+1 1  2X(n-1) 

-= ( 3 =- 3 
o2 

3 


Note that the probability is independent of n. 


Solution 2 

There is a shortcut for evaluating P(.N;|B1). Since we do not know how many black 
marbles there are in the bag at hand, we should consider all n + 1 possible configu- 
n(n + 1) 


n 
rations. The total amount of black marbles for all eventualities is Soi = 5 ; 


i=0 


2 
making the probability of getting a black marble at all equal to ————~ and the 


n(n +1) 
probability of getting one from a bag with k marbles P(N;,|Bi) = daa From 
n(n 


here we proceed as in the first solution. 


Solution 3 
Amit Itagi 


Let D; denote the event that the it” draw is black and Bg denote the event that there 
are k black marbles out of a total of n: 


P(D1) = $ P(Di|Be)P(Bs) = 5 oe 
k=0 k= 


P(D2N D ) = ŞT P(Dy ND, B,)P(Be) = : ye — 


k=0 n+1 r n-—1 
P(D pie i 1 Doro k(k = 1) 
7 P(D1) n—-1 Zro% 


wI N 
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The answer brings to mind the Monty Hall problem {100}. First grab two marbles 
blindfolded and then apply n = 2 on those two. After the first marble is black, we 
know there are one or two black marbles, but not zero. The first one could have been 


2 
black in BW or either one could have been black in BB. Therefore, P(B, B2) = T 


Quotient Estimates | 


Solution 1 to Question 1 
Faryad D. Sahneh 


Suppose X is uniformly randomly chosen in [0,1] and Y is uniformly randomly chosen 
Y 
in [X,1]. The ratio r = Xx is a random variable with the cumulative distribution 


function 


P(r > k) =P(Y > kX) = pew >kX|X =2x)dP(X = 7). 


1 
Since Y is drawn from [X, 1], P(Y > kX|X = 2) =1- lk 1) forl<k< = 
—@ £ 
and 0 for k > a 
T 


1 
Therefore, for a fixed k > 1, the interval of the integration above is 0 < x < P 


and dP(X = x) = dz: 
P 


Pe >k) = PY > kx) = f 1- (k — 1)de 
0 -E 


Solution 2 to Question 1 
Bogdan Lataianu 


Given z, 


1 


P(Y > kX) = Ply € (kx, 1)) = / dy 


ky L-z 


_1l-ke 


1 
0 < =. 
l= 7 k 


Hence, in the general case, 


PY > kX) = | es 
0 1l-z 
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Solution 3 to Question 1 
N.N. Taleb 


We can reduce this case to the distribution of 


X Cave). Q-XY+X 
w := max ; = 
(l-X)Y +X X X 
where both variables are uniform in [0,1], with 
Ww 1 

fu(w) = log (5) ——. 

w-—1 w 

Then 

aa w 1 
P(r > k) = I log (4) — — dw = (1—k)(log(k — 1) — log(k)). 

k w—1 wW 


Solution to Question 2 
Bogdan Lataianu 


With reference to Figure [7.4] {(zx,y):0<xz<y<1}= AABC. 


A D 
Figure 7.4: Quotient estimate 1. 


In A ABC, y > a; hence, always r > 1. It follows that, for k < 1, the probability 
P(r > k) = 1. For the case k > 1, having a joint uniform distribution means that the 
probability of (X,Y) falling into a given subset of A ABC is proportional to the area 


1 
of the subset. Now, for a fixed k > 1, r > kin A ABF, where F = (G 1), It follows 


that P(r > k) = Se =r (Here [F] denotes the area of shape F.) 


Solution 1 to Question 3 
A. Bogomolny 


X and Y being uniformly distributed on [0,1] means that their probability densities 
are defined by fx(x) = 1, x € [0,1], and 0 elsewhere. Similarly, fy (y) = 1, y € [0,1], 
and 0 elsewhere. Being the two are independent, their joint density is the product 
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of the individual two: fx y(#,y) = fx(x)fy(y). It follows that fx,y (x,y) = 1 for 
(x,y) € [0,1] x [0, 1] and 0 elsewhere. In other words, the joint distribution is uniform, 


and the reasoning used to answer question 1 applies here as well. 


Y X 
With reference to Figure for a given k > 1, x > k in AABF and y >k 
in A ADE. 


A 


Figure 7.5: Quotient estimate 2. 


It follows that 4 (x,y): maari = AABF U AADE. As a consequence, 
min{ X,Y} 
for k > 1, 
1 j 1 
[A ABF] + [A ADE] 3% T3 1 
> = = = è 
P(r = k) [A ABCD] 1 k 


For k < 1, the corresponding probability is always 1. 


Solution 2 to Question 3 


N.N. Taleb 
X,Y 
r= eee LT can be rewritten as 
min{ X,Y} 


a Y x 
r = max Yx lixyy<uy + Ux/y2uy- 


Yet the distribution of w, a ratio of independent uniform variables, in [0, 1] is 


1 
fu(w) = 1 f 


1 
Normalizing for values > 1 since r is necessarily > 1, we end up with f,.(r) = — and 
r 


=q 1 
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Quotient Estimates II 


Solution 1 


Let U = min {X;} and V = max {X;}. The question requests that we find P(V > 


kU), k>1. 

To solve the problem we have to integrate the joint distribution fy,y (u,v) over the 
shaded triangle (Figure[7.6). The triangle is a part of the region {(u,v) : u < v}. Thus, 
we only need to find fy,v(u,v) for u < v. 


A 


Figure 7.6: Quotient estimate 3. 


Let f(x) be the uniform density function on [0,1]. For all i = 1,...,n, fx,(x) = 


f(x). Let F(x) be the uniform distribution function F(x) = / f(€)dé: 


0, «<0, 
F(x)=4z, 0<2<1, 
1, #21. 


For all i =1,...,n, Fx,(x) = F(x). Now, since X4,..., Xn are independent, 
Fy(v) = P(V < v) = P(X < v)P(X2 < v) P(Xn < v) 
= Fy, WW) Fxg (v): Fx, (v) = F” (v) 


so that 
0, Uv + 
Fy(v) = 4v, O<v<il, 
i; 1 


For the cumulative distribution function, 
Fy y(u, v) = PU <u, V <v) 
= P(U < u, Xı <v, Xə < v,..., Xn < v). 
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As a reminder, U > u, if and only if X; > u for alli = 1,...,n. U < u means the 
opposite, i.e., that X; < u for some X;. Therefore, we may continue (with u > v): 


Fy y(u,v) = PU <u,V <v) 


= P(U <u,X1 <v, X2 < v,..., Xn < v) 


= P(X1 < v, X2 < v,..., Xn < v) — [[ PU < Xi < v) 
i=1 


= F” (v) — P(u < Xı < v)P(u < Xə < v) -- Plu< Xn < v) 
= F" (v) — (F (v) — F (u))” 


Solution 2 
Amit Itagi 


Let x and y be the smallest and largest of the chosen numbers. These numbers are 
shown on the x and y axes in Figure [7.7] The red region is not feasible as y > x. The 
region where the ratio is less than k is the green region. 
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-~ 
kad 

—_ 
~ 


y (Largest number) 


(0,0) x (Smallest number) (1,0) 


Figure 7.7: Quotient estimate 4. 


The probability density of the chosen set of points being in a window dady around 
(x,y) is 
f(x,y)dady = n(n — 1)(y — 2)"-2dedy. 
The (y — x)"~? is the probability of choosing the (n — 2) points between the two 


fixed endpoints. The n(n — 1) comes from choosing the endpoints. 
The required probability is one minus the integral of this density over the green 


region and is given by 


1 y 1 y 
P=1- J (x, y)drdy = 1 — J J n(n — 1)(y — x)" *daxdy 
y=0 J r=% y=0 Jx 


k 


iB 1 n-1 
=): -f ny”! (1 — =) dy 
y=0 | k 


Solution 3 

N.N. Taleb 

We consider 

XxX, 
ae i A Xi 
= ——— > max ae 

min {X;} 0<i¢j<n Xj 
l<i<n 


Xx, 
We know the distribution of w = — and can arrange m permutations, with m = 
j 
! 
u-n If we assume that n — 1 ratios are independent, with probability distribution 
n — 2)! 
1 
function f,,(w) = 32 and cumulative distribution function F,,(w) = Ba then 
w w 


1 n-1 
P(max{w1,w2,...,Wm} >k)=1-— (1-5) : RSL, 
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The Lost Boarding Pass 


Solution 1 
102| pp. 35-37] 


When the last passenger boards the plane, there are just two possibilities: the one 
remaining seat may be his assigned seat or that of the first passenger. Under the 
assumption that no preference has been exhibited by the boarding passengers towards 
either of the two seats, they each have the same probability to become the last unoc- 
cupied seat: 50%. 


Solution 2 
21| p. 176], B. Bollobas credits Oliver Riordan with the proof. 


Suppose there are n > 2 passengers and n seats. If, for any k < n, the first k passengers 
occupy the seats assigned to them (as a group, not individually), then from then on 
everybody, including the last passenger, sits in his own seat. Also, if one of the first 
k < n passengers occupies the seat of the last passenger then the last passenger will 
not sit in his own seat. Now, if, for k — 1, neither of these events holds, then the kt? 
passenger’s choice has exactly as much chance of leading to the first event as to the 
second. If his own seat is unoccupied, neither of the events will happen. If his own seat 
seat is occupied, then there is exactly one (unoccupied) seat, the first, that leads to 
the first event and exactly one (unoccupied) seat, the last, that leads to the second. 


Solution 3 
Sergej Saletic 


1 
There is T00 chance the first passenger sits in his own seat, ensuring the last passenger’s 
1 
seat will be available. The first passenger has a T00 chance to sit in the last passenger’s 


98 
seat, ensuring it will be occupied. Finally, the first passenger has a T00 chance to sit 


in someone else’s seat and, whenever that happens, the first “middle” passenger who 


1 
takes either the first or last passenger’s seat will do so with a probability of 3 for each. 


Therefore, 
1 98 1 1 1 
è 1 5 . = 
100 r 100 2 = 100 2 
Solution 4 
Giorgos Papadopulos 
Define: 


A, as the event that the kt? passenger sits in his own seat. 
Im as the event that the first passenger takes the mt? seat. 
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Then, 
n-k+l1 1 k-2 n-k+1 
P(Ax) = n eigen n n—k+2 
n—-k+1 
ac ee ae k +2) + (k— 2)| 
2 toed 
= n=k+2 


For example, for n = 8, k = 6, m = 1...,8, 

1. I, Ig, Iz, Ig events are obvious. 

2. in event I5, P(A2) = P(A3) = P(A4) = 1, so the fifth person has only four seats 
available: P(Ag|I5) = : 

3. in event I4, P(A2) = P(A3) = 1, so we have two cases. 
In the first I4 case, the fourth person goes to seat #5 and the fifth person has 
only four seats available: P(Ag|I4) = 4 
In the second I4 case, the fourth person goes to a seat different than #5. Then 


P(As) = 1 and four seats are available: P(Ag|I,) = J 


4. in event Iz, P(A2) = 1, so we have three cases. 
In the first [3 case, the third person goes to seat #4 and then we proceed similarly 
to event I4: P(Ag|Is) = - 
In the second I; case, the third person goes to seat #5. Then P(A) = 1 and we 
proceed similarly to event I5: P(Ag6|I3) = 

In the third Ig case, the third person goes to a seat different than #4 or #5. 


Then P(A,) = P(As) = 1 and there are four seats available: P(Ag|I3) = . 


3 
5. in event I2, with similar reasoning as above, we have four cases and P(Ag|I2) = T 


Adding probabilities, we get 


8 
1 3 3 3 3 3 
Plo) = D> Pll) = 5 (1+3) 


and 
3 

P(A) =1——=-. 
(Ag) = 1-5 


Similarly, for k = 8, 


PUAN == a4 Oe: 
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Solution 5 
Jim Totten, Problem 324], considers the situation when the passengers board a plane—a 
bus actually—in numerical order of their seats. 


Let P,, be the probability that passenger pn will sit in seat sn. We will use mathematical 


1 
induction to show that Pa = 3 for n > 1. First, with n = 2, there are two equiprobable 


1 
events: pı, p2 sit in 51,82 or in 59,81, making Pz = 5 +0) = 5 
1 
Now, suppose n > 2 and that Pm = 5 for 1 < m < n. If passenger pı chooses 


sı, then passenger pn sits in Sn with probability 1. If pı chooses seat sn, then pn sits 
in Sn with probability 0. If pı chooses są with 1 < k < n, passenger p; sits in s; for 
1<1i<k, and passenger pp will have to choose from seats sı or s; withk+1<j <n. 

Since this is essentially the same situation as if passenger pp were pı with n—k+1 
passengers and seats, we conclude from the induction hypothesis that the probability 


1 
that passenger pn sits in seat Sn is 5 for each of the n — 2 choices of k, 1< k <n. 
Therefore, 


1+0+(n—-2)- 


P= = 
n 2 


NI = 
= 


Lucky Contest Winners 


Solution 1 
The simplest way to approach the riddle is to consider the associated sample space. 
There are two girls and two boys from which a pair of winners is going to be drawn. Let 
us denote the girls G and g, and the boys B and b. Formally speaking, there are 16 = 
4 x 4 possible ordered pairs formed by the four letters. However, from the context of 
the problem, the repetitions, like GG, make no sense and should be excluded outright. 
The first three questions (with the last coming first) are collected in the follow- 
ing tables: 


One of the winners First winner Second winner 
is a girl is a girl is a girl 

The legend is this: The impossible pairs (two boys or the same girl twice) are 
marked with a “x”. The two girl combinations are marked with “+” and all the others 
with “—”. In the first question (the middle table), the first member of the pair is known 
to be a girl, making the four combinations in the lower left corner impossible. These are 
also colored blue. A similar treatment applies to the upper right corner in the last table 
(second question). Of the remaining squares, the ones with favorable outcomes—Gg 
or gG—are colored yellow and the unfavorable ones are colored red. 
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For the first two questions there are six possible outcomes: two favorable and four 
unfavorable. In both cases then the probability that two girls have been selected to 
2 1 
advance to the national finals is gre 
For the third question, you were as likely to cast your eyes on the first girl as on 
the second. By the symmetry principle, the probabilities of the two events are equal to 


1 1 
z whereas each of the girls has an individual probability of 3 to be a winner. Thus, 


11 11 41 
the total probability of both girls is =-= + =-=- =-=. 
2. 8 2: BS 

For the fourth question, there are 10 possible outcomes with only two favorable. 


2 1 
The probability of having two lucky girls is then 0 7 5 


Solution 2 
Mike Shaughnessy 


If the girl that you saw on the stage was the first winner, then, out of the three students 
left for the random drawing for a second winner, only one is a girl and two are boys. 


The chances that the second winner is also a girl will be 1 in 3, or = 

However, if the girl you saw on stage was actually the second winner, then the 
chance that the first winner was also a girl is > because before the first girl was 
picked there were two girls and two boys, so the chance of choosing a girl was then 2 
in 4, or =. 


Similiarly, for question 2, if the girl you saw on the stage was the first winner, then, 
out of the three remaining students, one is a girl and two are boys, so the chances that 


1 
the second winner will also be a girl is 3" 


Unlike question 1, it does not make any difference whether you saw the first girl or 
the second girl; the fact that you saw a girl winning at all means that there is only a 
1 in 3 chance that the other winner is a girl. Therefore, the probability that the first 
winner was also a girl, if the girl that you saw on the stage is the second winner, is 

1 


also —. 


3 


Notes 


The solutions are followed by advice to run a simulation. Before you do, give a thought 
to the fact that the second part of the first solution does not exploit the fact that the 
student you saw as the second winner was a girl. The argument would be the same if 
you saw a boy. This should make one suspicious of this line of reasoning. 


Solution 3 


To strengthen the second solution, we may apply the Bayes theorem. In our notations, 
we would like to evaluate P(G|g)—the probability that the first winner was a girl 
provided the second winner was also a girl. We accept as known (obvious or from the 
argument above) that 
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1 
1. P(G) = P(B) = 5 i.e., the probabilities of the first winner being a girl or a boy 


1 
are both equal to 5 


1 
2. P(g|G) = 3? i.e., the probability of the second winner being a girl provided the 


1 
first one was a girl is 3 (solutions 1 and 2). 


2 
3. P(g|B) = 3? i.e., the probability of the second winner being a girl provided the 


2 
first one was a boy is z This is because, when the first boy was out after the 


first drawing, there were two girls and one boy remaining for the second drawing. 


Now, let us use Bayes’s formula: 


P(Gg) P(g|G)P(G) 
Plo) = BG) = P@IG)PIG) + PUIB)PB) 
11 1 
3 2 3 
“11 21 1 2 
33'323 "3 
1 
=>. 
Solution 4 


Looking back, it might have been easier to evaluate P(Gg) outright. This is the prob- 


4 
ability that the two winners were both girls. There are ( 3) = 6 cases to choose two 


1 

items out of four. We are only interested in one selection. Therefore, P(Gg) = A 
= ba d : 1 1 1 
Divide by P(g) which is 5 and we obtain P(G|g) = 6 + ga 


Diminishing Hopes 


7 2 1 
The answers to the three questions are 9° 3 and 3° respectively. The result may 


be surprising. Chances of locating the document decrease with every checked drawer. 
The general formula for the probability of finding the document after unsuccessfully 
8—n 
10-—n 
can check the solutions below. 


checking n drawers is Pp = . This is a decreasing function of n for n < 10. You 


Solution 1, Clever 
and, independently, Michael Wiener 


Let us add two imaginary drawers to the desk and assume that this is where the 
documents get lost. We may even think of them as being locked during the search. 
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With k drawers to go, of which two are unavailable, the probability of a successful 
k—2 — 
` With k = 10 — n, this reduces to P, = = " | This is a decreasing 
-n 
function of n for n < 10. 
It is also curious that, if we only ask about the probability of finding the document 
in the next available drawer (after checking the first n drawers), the probability will 


be Qn = 


outcome is 


, an increasing function of n for n < 10. 
n 


10 — 


Solution 2, Counting Odds 
Joshua B. Miller 


Not finding the document after k checks is 3 2 7 times more likely if it is not in the 


—k 
desk (not found with probability 1) versus in desk (not found with probability =="). 


Prior odds it is not in the desk are 1 : 4, so posterior odds it is not in the desk are 
E : 4, i.e., 2 : (8 — k). Therefore, odds in favor are (8 — k) : 2, i.e., (1) 7 : 2, (2) 
1 


7 4 
4:2, (3) 1 : 2, which translates into the probabilities (1) p (2) A (3) 3° 
Solution 3, Conditional Probabilities 
Amit Itagi 


Let us denote the events “document is lost” by L and “document is in drawer 7” by D;. 


1 4 1 
Let the unconditional probability of D; be q; q = 3 57 T0 The overbar designates 
a complementary event. Then 


1. PD) = = = = 


2. P|, D) == ane - OA) = 


7 DIT 
3. PT nZ D;) = Pier Pit) _ (1/8) (8q) _ L, 
P(Us_, Di) 1—7q 3 


Solution 4, Bayesian Odds 
A. Bogomolny 


We assume that the search for a document is thorough enough to ensure that a docu- 
ment that was stored in the desk will be necessarily found. 

Let D denote the event that the sought document is in one of the drawers; D that 
4 
5 > 
the first k drawers. Then, computing the Bayesian odds, 


= 1 
it is not: P(D) = =, P(D) = zi Let K be the event of not finding the document in 
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P(D 1 = 

om aie P(K|D) = 1 and P(K|D) is the probability of the document not 

being found in the first k drawers, even though it is somewhere in the desk. This exactly 
—k 

means that the document is in the 8 — k remaining drawers, making P(K|D) = -n 


Thus, 


where 


PDK) 8 1 22 
P(D|K) 8-k 4 8-k 


= —k 
Since P(D|K) + P(D|K) = 1, the above resolves into P(D|K) = os 


This gives the three answers above. 


Solution 5, Bayes’s Theorem 
A. Bogomolny 


In the notations of solution 3, 


P(D|K)a = P(K|D) - P(D) D 8 
-~ P(K|D)- P(D)+ P(K|D): P(D) 8-k 


Incidence of Breast Cancer 


Introduce the events: 
e P—mammogram result is positive, 
e B—tumor is benign, 
e M—tumor is malignant. 


Bayes’s formula in this case is 


P(P|M)P(M) 
(P|M)P(M) + P(P|B)P(B) 


P(M|P) == 


B .80 x .01 
~ 80x .01 +.10 x .99 
zx 0.075 


= 7.5%. 


The result is a far cry from a common estimate of 75%. 
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A Search for Heads and Its Consequences 
With assistance from Joshua B. Miller, Alejandro Rodriguez, Thamizh Kudimagan and 
Zhuo Xi. 


Solution to Question 1 


There are four possible outcomes for the first two flips: TT, TH, HT, HH. Since Jack 
selects a flip immediately preceded by a heads, the first two sequences could not have 
led to his selection of the second flip. Two possibilities remain: HT and HH, which 
are equiprobable. However, in the second case, Jack had a choice between selecting the 
second or the third flips. Thus, the probability of selecting heads is split. Hence, the 


1 
answer is z 


Solution to Question 2 


Since the third flip was selected, the second one is bound to be heads. Thus, we have 
four possibilities: THT, THH, HHT, HHH. They are all equiprobable, so that the 


; ; ; 1 
answer in this case is also z 


Solution 1 to Question 3 


We do not know what Jack’s selection was, but we do know that heads came up 
in the first flip or the second (or both). Thus, we have six possible sequences (i.e., 
excluding TTT and TTH): HTT, HTH, THT, THH, HHT, HHH. Each of these 


1 
has the probability of 6 of occurring. In the first four Jack has no choice and is bound 
to selecting a unique one. His election is a heads in just one case, THH, with the 


1 
probability of 5 Of the remaining two cases, where Jack has a choice to make, heads 


would be selected in three out of four possibilities: HHT, HHT, HHH, HHH. Thus, 


1 1 
the probability of a heads flip being selected is JŠ : =y 
Combining the two cases, heads would be selected with the probability of 
1 i 1 5 
6 4 1? 


Solution 2 to Question 3 

A shorter way is to combine the answers to the first two questions, since it is equally 
1 1 1 1 5 

22 23T TE 


likely to be the first or second flip: 


A Three Group Split 


Let us represent the three group split by a sequence of three digits, each of which 
standing for the amount of even numbers in a group. The order of the groups is of no 
consequence. The starting configuration is 000. There are nine open slots to place the 
first even number in. However it is done, the next state is 100. 
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With one even number in place, there are eight slots—two in the same group as 
the first even number. Placing the second even number there moves to the state 200; 


6 
placing it elsewhere, with probability 3? will move to the state 110. We continue in 


this manner, distributing even numbers, of which there are four. 

When all is said and done, there are only three possible states with four even 
numbers: 211 (meaning no group has three odd numbers), 220 and 310. In each of the 
latter two cases, there is one group (corresponding to 0) in which all numbers are odd. 

The probability of getting into the state 220 is 


6 4 2 6\ 2 
G eee a s) 6 
24+12 1 3 
56 3 M 
The probability of getting into the state 310 is 


P(220) 


2/16 61 6 4 1 
P(310)= 2. (294 ho H= 
(sn) ==: (5-5 DERE 

1 6+6 1 2 

4 42 4 14 


2 
Thus, the probability of having a group of odd numbers is I4 + = = 2 
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Lewis Carroll’s Pillow Problem 


According to Martin Gardner: 


In 1893 Carroll published a little book of seventy-two original mathematical 
puzzles. ... The most interesting puzzles in Pillow-Problems concern prob- 
ability. The first one, problem 5, is simple to state but extremely confusing 
to analyze correctly. 


To help the reader see the difficulty and perhaps make the presence of the confusion 
clear, Carroll gave two solutions. 


Solution 1 
Lewis Carroll 


As the state of the bag after the operation is necessarily identical with its state before 


1 
it, the chance is just what it was, viz. z 


Solution 2 

Lewis Carroll, Martin Gardner 

Let B and W; stand for the black or white counter that may be in the bag at the start 
and W2 for the added white counter. After removing a white counter there are three 
equally likely states: 


Inside Bag | Outside Bag 
Wi W2 
W2 Wi 
B Wo 


In two of these states, a white counter remains in the bag, so the chance of drawing 


2 
a white counter the second time is 3 


Notes 
Some time after putting the problem on the web I (AB) received a letter from 
Chris Conradi: 

Carroll’s Pillow Problem says that we know a bag contains one counter, and it is 
either black or white. The solution presumes that the bag is equally likely to contain 
a black counter or a white counter, although neither Carroll nor Bogomolny makes 
that clear. It would be helpful to make that point in the statement of the problem. 
Otherwise, the answer cannot be determined from the statement of the problem. 

This is just a particular instance of a more general problem. Suppose that the first 
counter is drawn at random from another bag containing d counters, n of which are 
white and d — n of which are black. Without revealing its color, this first counter 
is placed in a second, empty bag. Then a white counter is added to this bag. Now 
you draw a counter from this second bag and it turns out to be white. What is the 


n 
probability that the remaining counter is white? The general answer is TI i.e., 
n 
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2 
n = 1 and d = 2. The answer is 3? as given, but we can also determine the probability 


ifn =1and d=4(2), ifn =3.andd=4($) or ifn =5 and d= 18( 5). 


A Follow Up on Lewis Carroll’s Pillow Problem 


Solution 1, Correct and Short 
Joshua B. Miller 


Prior odds in favor of two white counters in the second bag are w : b, but drawing 
w is twice as likely under ww as under bw, so posterior odds in favor are 2w : b, i.e., 
2w 


Prob(ww|drawn w) = Foe 
w 


Solution 2, Correct and Long 


Introduce P(W) = rat probability of the counter removed from the first bag being 
w 


b 
white. For the black counter, it is P(W) = ar P(W|W) and P(B|W) are the 
w 


probabilities that the remaining counter in the second bag is white and black, respec- 
tively; P(W|W) and P(W|B) are the probabilities of removing of a white counter 
from the second bag, provided that it contains two white counters or a black and a 
white counter. 


P P P 
The Bayesian odds are expressed as eae = ene . ae . It is obvious 


P(WIW) .. 
that r~ = 2, impl 
a P(WIB) » Implying 
P(W|W) | 9. YL 2w 
P(BIW) “b è 
2 
Since P(W|W) + P(B|W) = 1, we derive P(W|W) = 5 =; 
w 


Solution 3, Correct and Long 


We will use Bayes’s theorem. The notations are from the previous solution: 


P(W|W)- P(W) 
P(W|W) - P(W) + P(W|B) - P(B) 


P(W|W) = 
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Solution 4, Incorrect 


There is a 50% chance that the remaining counter is from the original bag, so P = 


FF There is another 50% chance that the remaining counter is the added white one. 


w 
Together, these have a proability of 1. Therefore, the sought probability is 0.5 + 
w 
2w +b 
0.5, which simplifies to P = TE 


How to See That Solution 4 Is Incorrect 


Assume that there were no white counters in the first bag: w = 0. Then the probability 


1 
of the leftover counter being white is zero. However, for w = 0, the formula produces z 


Sick Child and Doctor 


This is a direct application of Bayes’s theorem: 


P(R|M)P(M) 
(P(R|M)P(M) + P(R|F)P(F)) 
B .95 x .10 
~ 95 x .10 + .08 x .90 
zx 0.57. 


P(M|R) = 


The result is nowhere close to 95% of P(R|M). 


Right Strategy for a Weaker Player 


Solution 1 


The strategy is to play a daring game when tied or losing and a conservative game 
when ahead in the match. 

After two games, there are four possible outcomes (from A’s perspective, with D, 
W, L standing for Draw, Win, Lose): 


Result Conclusion | Probability 
LL match lost 0.55 - 0.55 = 0.3025 
LW tiebreaker required | 0.55 - 0.45 = 0.2475 
WL tiebreaker required | 0.45 - 0.10 = 0.0450 
WD match won 0.45 - 0.90 = 0.4050 


The tiebreaker happens with the probability of 0.2925, so A is going to play a 
daring game, winning with the probability of 0.45. This makes the total probability of 
his winning the match equal to 


0.4050 + 0.45 - 0.2925 = 0.536625 ~ 54%. 
Solution 2 


N.N. Taleb 


The approach is to use a simplified form of stochastic dynamic programming, given 
that we are initially facing a 2 period decision tree. 
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We have two regimes indexed by 1 (aggressive), 2 (conservative) and their corre- 
sponding probabilities: 


Pw, Pwa 
Pi = | pa Pa = | Pa, 
Ph Ply 


We also have a payoff vector per period for win, draw or lose of À = (1,0,—1). 
We look at the expectation at every step. Note that for the dot product P,.(1 +A) 
can have for notation P,.(1 + AT). 


Pw, -Pi.(1 + A) (aggressive, win, then aggressive) 
Pw, -P2.(1 + A) (aggressive, win, then conservative) 


a,-P,.(A) (aggressive, draw, then aggressive) 
dı -P2.(A) (aggressive, draw, then conservative) 


pi, -P,.(—1 + A) (aggressive, loss, then aggressive) 
pi,-P2.(—1 + A) (aggressive, loss, then conservative) 
max 
Pwo-P1.(1 + A) (conservative, win, then aggressive) 
Pws-P2.(1 + à) (conservative, win, then conservative) 


Ddy-P.(A) 
Pde .P2.(X) 


Pi, -Pi.(—1 = A) 
Pia -P2.(—1 aa à) 


Chickens in Boxes 


Solution 1 


All in all, there are 16 equiprobable ways to place chickens into four boxes. Only three 
of them satisfy the conditions of the problem: 


Box 1 2 3 4 
chicken chicken 
chicken chicken 
chicken 


Thinking of the three possibilities as equiprobable, the corner box 3 contains a 
chicken only once; hence, the chance of this happening is 1 : 2. The same is true of 
boxes 1 and 2. Box 4, on the other hand, contains a chicken twice; hence, it contains 
a chikcen with the chances of 2 : 1. 
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Solution 2 


Probability that the corner box contains a chicken, provided there is only one chicken, 
P(B3|1 chicken) = 1; in the case of two chickens, P(B3|2 chickens) = 0. The first 
event occurs in one case (see the table above); the second event occurs in two cases. 
Therefore, the chances that the corner box contains a chicken are | : 2. 

Similarly, for box 4, P(B4|1 chicken) = 0; P(B4|2 chickens) = 1. The first event 
occurs in one case, the second in two. Hence, for box 4, the chances are 2: 1. 


1 
By the same reasoning, P(B,|1 chicken) = 0; P(B,|2 chickens) = a Therefore, the 


1 
chances for Bı (and, similarly, Bz) are 5° Ik shee. 


Two Chickens in Boxes 


Solution 1 
As in the previous riddle, there are 16 equiprobable chicken placements, of which five 
satisfy the conditions of the problem: 


Box 1 2 3 4 
chicken chicken 
chicken chicken 


chicken | chicken 


chicken chicken 


chicken | chicken 


2 
Thus, boxes 1 and 2 contain a chicken with the probability 5} for boxes 3 and 4, 


the probability is 3 


Solution 2 
Joshua B. Miller 


Chicken placements CCaz, CaCx, CaaC, «CCa, xCaC, xxCC are equally likely. It 
follows that prior odds in box 3 are 1: 1, 


Pr(“yes”|in box 3) = 1, 


2 
Pr(“yes”|not in box 3) = 3 
: 3 ; 
and posterior odds are 3° 1, or 3: 2, i.e., 


Pr(in box 3) = . 
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Two Chickens in Bigger Boxes 


Solution 1 


As in the earlier riddles, there are 16 equiprobable chicken placements, of which five 
satisfy the conditions of the problem: 


Box 1 2 3 4 
chicken chicken 
chicken chicken 


chicken chicken 


chicken chicken 


chicken chicken 


2 chickens 


1 
Thus, boxes 1 and 2 contain a chicken with the probability 3} for box 3, the 


2 1 
probability is =; for box 4, it is 3 Su 
3 6 2 


Solution 2 


Let us follow where the chickens go. Each has equal probabilities of getting into any 
of the four boxes. Some configurations are OK, while some violate the constraints of 
the riddle: 


chick 1 in box / chick 2 in box | 1 2 3 4 | 
1 X | X | OK!| OK | 
2 X | X | OK!| OK | 
3 OK! | OK! | OK! | OK! | 
4 OK | OK | OK! | X | 
There are 11 legitimate, equiprobable situations: in seven of them, the corner box 
is not empty; in six, box 4 is not empty; in four, boxes 1 and 2 are each not empty. 


4 4 7 6 
Thus, the probabilities for the four boxes not to be empty come out as IP IT IT IE 


Notes 
Why are there two solutions and which is correct, as the two cannot be correct at the 
same time? The difference between the two is in that the first one treats the chickens 
as indistinguishable while the second presumes that they are different: chicken 1 and 
chicken 2. 

I do not believe that the problem is suggestive of the idea of two different chickens, 
although solution 2 has been posted on Twitter several times. I included it here to give 
a reason to remark on that topic. 


Chapter 8 


Expectation 


If you hear a “prominent” economist using the word ‘equilibrium,’ or 
‘normal distribution,’ do not argue with him; just ignore him, or try to 
put a rat down his shirt. 


Nassim Nicholas Taleb, The Black Swan: The Impact of the 
Highly Improbable 


What Is Expectation? 

In a sample space of equiprobable outcomes, the probability of an event is the ratio 
of the number of favorable outcomes to the total size of the space. This means that 
the probabilities of events are defined in relation to each other. If there is a finite 
number of exhaustive (i.e., at least one always happens) and mutually exclusive (i.e., 
no two happen simultaneously) events Ak, k = 1,2,...,K, with nę being the number 
of favorable outcomes in Ax, then 


Nk 
N ? 
where N = nı +#no +: +ng. 

Borrowing an example from the classic text by W. Feller [88], in a certain popu- 
lation, ny, is the number of families with k children. N is then the total number of 
families and, assuming the most benign of circumstances, there are 2N adults. How 
many children are there? To state the obvious, each of the nọ families with k kids has 
k kids, so the number of kids in such families is kng. The total number of kids is the 
sum of such products over all the various family sizes: 


P(A) = 


T=1-n,+2-not---+k- -ng+ +K- nk, 


where K is the number of children in the largest family. On average, every family has 


T 
E= = kids: 
pul 
ON 
Spe ae ee LR E 


N N N 


K 
=J kpr 
k=0 


Tk 


where pk = N is the probability for a family to have k children. Expectation is one 


of the most important notions in the theory of probabilities. We shall give a more 
general definition. 
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Let X be a random variable that takes values xz, k = 1,2,...,K, with the proba- 
bility pk: P(X = xz) = pp. The sum 


K 
= X TkPk 
k=1 


is known as the mathematical expectation of X (and often the expected value, the mean 
or simply the average). 

The main feature of the expectation is its linearity: for two random variables X 
and Y, and two real numbers a and b, 


E(aX + bY) = aE(X) + bE(X). 


This is actually a direct consequence of the additive definition. 


Riddles 


8.1 Randomly Placed Letters in Envelopes 


Problem 104] 


N letters are randomly placed into N envelopes, one per envelope. What is the expected 
number of letters which get into the correct envelopes? 


8.2 Expected Number of Fixed Points 


The previous riddle of misplaced letters/envelopes admits a more formal formula- 
tion Chapter 5]: 


Let N,, = {1,2...,n}. Consider a permutation p of N,. A point k is fired 
by p if p(k) = k. Permutations with no fixed points are known as derange- 


ments. 


Considering all possible permutations, what is the expected number of fixed points? 


8.3 An Average Number of Runs in Coin Tosses 


A coin is tossed several times and the outcomes are recorded in a string of H (heads) 


and T (tails). For example 


DG 


is recorded as HHTTTHT. Now, a run is any part of such a record that consists of 
a maximal number of contiguous equal outcomes. In the example, we have four runs: 
HH, TTT, H, T. Their lengths are 2, 3, 1, 1, respectively. Now, here is the question: 


A coin is tossed N times. What is the expected value (average) of the 
number of runs? 
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8.4 How Long Will lt Last? 
Problem 98] 


In a coin tossing game, you win if our tosses match; I win if our tosses differ. The 
winner collects both coins. You start with m coins; I start with n coins. 
How long, on average, will the game last until one of us is wiped out? 


8.5 Expectation of Interval Length on Circle 
Volume 2, Chapter 1] 


Two points, X and Y are uniformly distributed on a circle of circumference 1. 
e What is the expected arc length of interval |XY|? 


e Point P is fixed on the circle. What is the expected arc length of interval |XY| 
that contains point P? 


e What if there are three random points? 


e What if there are n random points? 


8.6 Waiting for a Train 
pp. 22-23] 


Every morning at a railroad station there is just one train arriving between 8:00 and 
9:00 and another arriving between 9:00 and 10:00. The arrival times and their proba- 
bilities are shown in the following table: 


| Train A 8:10 | 8:30 | 8:50 
| Train B 9:10 | 9:30 | 9:50 
1 1 1 


A traveler arrives at 8:20. What is the expectation of his waiting time? 


8.7 Average Number of Runs in a Sequence of Random Numbers 
pp. 29-33] 


M numbers are randomly selected from the set {1,...,N}. A run is either a decreasing 
or an increasing subsequence of maximum stretch, i.e., “subruns” do not count as runs. 

Assuming no number was selected more than once in succession, what is the average 
number of runs? 


8.8 Training Bicyclists on a Mountain Road 
Problem 65352| 


In the movie GoldenEye, racing James Bond startles a group of bicyclists so that they 
fall over in a chain reaction. Relaxing the story, assume each of the n bicyclists in a 
line tumbles with the probability of p. Once one of them tumbles, all following him 
tumble too. 

What is the expected number of bicyclists who fall on the road? 
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8.9 Number of Trials to First Success 


Let V be the event that occurs in a trial with probability p. Is the mathematical 


1 
expectation of the number of trials to the first occurrence of V in a sequence of trials —? 
P 


8.10 Waiting for an Ace 
Problem 40] 


After a thorough shuffle of a deck of 52 cards, you deal cards from the top of the deck. 
How long—on average—will you have to wait for an ace? 


8.11 Two in a Row 
Problem P1983-2| 


Suppose you repeatedly toss a fair coin until you get two heads in a row. What is the 
probability that you stop on toss number 10? 


8.12 Waiting for Multiple Heads 


Flipping a coin, heads comes up with the probability of p, 0 < p< 1. 
e What is the expected number of flips before getting three heads for the first time? 


e What is the expected number of flips before getting four heads for the first time? 


8.13 Probability of Consecutive Heads 
Josh Jordan 


A biased coin takes 100 flips on average to get 10 heads in a row. What is the bias? 


8.14 Two Dice Repetition 


Problem 409] 


There are two fair dice, one marked as usual with 1, 2, 3, 4, 5, 6, the other with 
B,I,N,G,O,*. Both dice are tossed on a move. What is the expected number of the 
move at which the first repetition occurs in each of these cases: 


1. All 36 combinations (1B through 6*) are considered to be different (and 
equally likely)? 


2. As before, but all combinations 1*, 2*, 3x, 4*, 5*,6* are considered the same? 


8.15 Expected Number of Happy Passengers 
This is a follow-up to Riddle [7.8] on page the current riddle was suggested to me by 


Konstantin Knop and to him by Alexander Pipersky. 


On a sold out flight, 100 people line up to board the plane. The first passenger in the 
line has lost his boarding pass, but is allowed in, regardless. He takes a random seat. 
Each subsequent passenger takes his or her assigned seat if available, or a random, 
unoccupied seat, otherwise. 

A passenger is happy if he/she occupies his/her own seat, unhappy otherwise. What 
is the expected number of happy passengers? 
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8.16 Expectation of the Largest Number 


n balls are drawn, without replacement, from an urn containing N > n balls, numbered 
l to N. 
What is the expected value of the largest number drawn? 


8.17 Waiting for a Larger Number 
pp. 31-32] 


In what follows all numbers are uniformly distributed on [0,1]. Start with a number 
So and then keep generating the sequence S1, S2,... until, for some N, Sy > So for 
the first time. 

What is the expected value E(N) of that N? 


8.18 Waiting to Exceed 1 

pp. 31-32] 

In what follows all numbers are uniformly distributed on [0,1]. Start with a number 
N 


So and then keep generating the sequence S1, S2,... until, for some N, >» Sk > 1 for 
k=0 


the first time. 
What is the expected value E(N) of that N? 


8.19 Waiting for All Six Outcomes 
102| pp. 35-37] 


On average, how many times do you need to roll a die before all six different numbers 
show up? 


8.20 Walking Randomly—How Far? 
Problem 115] 


A fair coin is flipped repeatedly. Starting from « = 0, each time the coin comes up 
heads, 1 is added to x, and each time the coin comes up tails, 1 is subtracted from z. 
Let an be the expected value of |x| after n flips of the coin. 

Does an > œ as n > 00? 


8.21 Expectation of Pairings 
Problem 22] 


Suppose that eight boys and 12 girls line up in a row. Let S be the number of places 
in the row where a boy and a girl are standing next to each other. For example, for 
the row 


GBBGGGBGBGGGBGBGGBGB, 


we have S = 13. 
Find the average value of S' (if all possible orders of these 20 people are considered). 


196 Cut the Knot: Probability Riddles 


8.22 Making Spaghetti Loops in the Kitchen 
This riddle relates to Riddles and In pp. 3, 7] P. Winkler also refers to Martin 


Gardner’s [42]. 


The 100 ends of 50 strands of cooked spaghetti are paired at random and tied together. 
How many pasta loops should you expect to result from this process, on average? 


8.23 Repeating Suit When Dealing Cards 
Exercise 9.10.13] 


Consider a standard deck of 52 cards, thoroughly shuffled, making all card orderings 
equally likely. We pick cards one at a time until we get two cards of the same suit. 
What is the expected number of cards drawn before two cards of the same suit are 
drawn? Consider two cases: 


1. Drawing cards with replacement. 
2. Drawing cards without replacement. 


Before running the calculations, guess which is smaller. 


8.24 Family Size 


A young couple plans to start a family. They consider five different strategies. What 
is the expected family size 


1. if they stop having children after having one boy? 
2. if they stop having children after having one girl? 
if they stop after having children of both sexes? 


if they stop after they have a child of the same sex as the first one? 


pr aa ig 


if they stop after they have a child of the sex different from that of their first child? 


8.25 Averages of Terms in Increasing Sequence 
B-425] 


Let k and n be positive integers with k < n and let S consist of all k-tuples X = 
(£1, £2,..., £k), with each z; an integer and 1 < £1 < £2 <... < Ek S N. 
For j = 1,2,...,k, find the average value 7; of xj over all X in S. 
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Solutions 


Randomly Placed Letters in Envelopes 


1 
Each letter has the probability of N to get into the right envelope. If E is the event 


of the k*t! letter getting into the right envelope, 


E k if the k* letter gets into the right envelope 
k = 


0, otherwise. 


-1 
N 
Although the events are not independent, the expectation being linear, we still have 


N N 1 
E (Za) = 5° E(Ex) =N- yal 
k=1 k=1 


Thus, for all N, the expected number of correct fits is one. For N = 2, either both 
letters get into the right envelopes, or none does. Both events have the probability of 


1 N 1 
The expectation of Ep is E(E,) =1- N +0- = for all k. 


1 
z with the expectation of 


E(E, U E2) =0- 


Expected Number of Fixed Points 


Solution 1 
Chapter 5] 


The expected number in question, which we shall denote E(N), equals 
E(n) = 5 kar, 
k=0 


where gq, is the probability of there being exactly k fixed points. The number of per- 
mutations with exactly k fixed points is the number of ways of choosing k points out 
of n times the number of derangements of the remaining n — k points. It follows that 


where !m is the subfactorial of m, the number of derangements of a set of m ele- 
ments. Thus, with a substitution m = n — k and using the symmetry of the binomial 
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; n n 
coefficients, ( ) = ( i; we get 
n-m n 


I 
Me 
TD 
| 
= 
fo ON 
3 3 
5 
I 
M= 
D 
a 
| 
= 
5 


7 m!(n —m— 1)! 


n 
n—1 
Still, it is easily seen that 5 ( sm = (n — 1)! because every permutation 
m 
mal 
has a certain number of fixed points with the remaining points deranged. (As above, 
-1 -1 
but with n— 1 points, there are ( n ) Im = (" ) -Im permutations with m 
n m 


—m-1 
fixed points.) Thus, from (n — 1)!- E(n) = (n—1)!, we get E(n) = 1, independent of n. 


Solution 2 
Roland van Gaalen 


Use the indicator function I(k) = 1 if there is a fixed point at k and 0 otherwise. 


1 
EI (k)] = for each k. By the additivity of the expected value, 


Solution 3 
Amit Itagi 


Of n! permutations, (n — 1)! fix a particular point. Thus, the total number of fixed 


points over all permutations is n x (n — 1)!. It follows that the expected number is 
nx(n=1)! _ i 


n! 
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An Average Number of Runs in Coin Tosses 
Eli Bogomolny 


Instead of counting the number of runs, we will count the number of breaks between 
the runs, i.e., the number of pairs HT or TH. Obviously, the number of breaks is one 
less than the number of runs. 

As has been noted, a break occurs when two successive tosses have different out- 
comes. All in all, two coin tosses may have four outcomes: the two tosses are different 
in two cases and equal also in two. Thus, the probability of a break occurring over a 


1 
particular pair of tosses is 5 


For N tosses of the coin, there are N — 1 successive pairs of tosses, each having 


1 1 
an expectation of 0 x 3 +1x 273 of constituting a break. The expectation being a 
f : . . N-I 
linear function, the expected number of breaks among N — 1 pairs is y 


N-1 N+1 
It follows that the average number of runs is 5 +1= TE, 


How Long Will It Last? 


Solution 1 

Let us denote K = m +n. There are two ways the game may end: the winner is either 
you or me. Conditioned on me being the winner, let L(x), 0 < x < K, be the expected 
length of the game if I have x coins. There is an obvious recurrence relation 


L(x) = ILe —1)+ iLE +1)+1, 


1 
as I can win or lose a coin with the same probability of 7 One is added to the 


account for the present toss. 
Thus, we have an inhomogeneous difference equation of order two: 
L(x +1) — 2L(x)+ L(x- 1)+2=0. 

The characteristic polynomial of the homogeneous equation has a double root 1 such 
that the general solution of the inhomogeneous equation is a quadratic polynomial, say 
L(x) = u + vz + wz?. Substituting that into the recurrence yields 2w = —2 so that 
w=-l. 

For the boundary conditions, we have two: L(0) = 0 and L(K) = 0. From the first 
one, u = 0, and from the second, v = K. Thus, the solution is L(x) = «(K — x). If I 
start with n coins, L(n) = n(K — n) = mn. 

Naturally, the expectation is symmetric in n and m. 


Solution 2 
p. 106] 


Let E(n,m) be the expected length of the game in the case I start with n and you 
start with m coins. On a toss, we will move to either E(n—1,m+1) or E(n+1,m-—1), 


1 
each with the probability of z which leads to a recurrence relation: 


E(n,m)=14 


1 
5 E(n 1jm+1)+ z£ + 1,m—1), 
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1 indicating the first toss. Of course this assumes n,m > 0. The boundary conditions 
are E(0,m) = 0 and E(n,0) = 0. 

If we view E(n,m) as the function of one variable, say, n along the line m+n = 
constant, then the formula says that the second difference is a constant (—2), and so 
E(n,m) is a quadratic function. Vanishing it at the endpoints forces this to be emn, 
and direct evaluation shows that c = 1, implying E(n,m) = mn. 


Solution 3 
Christopher D. Long 


Let E(k) be the expected value of the length of the game, if, say, I am in possession 
of k coins. Then 


E(k) = TER piate ii. 


It is convenient to reduce that to a difference equation: 


(E(k +1) — E(k)) — (E(k) — E(k — 1)) = -2. 


In terms of the difference operator A, A? E = —2, implying that E is a quadratic 
function: 
E(k) = —k(k—1)+ak +b 


with the boundary conditions £(0) = 0 and E(m + n) = 0. From the former, b = 0 
and, from the latter, a = m +n — 1 so that E(k) = —k(k — 1) + k(m +n — 1). At the 
beginning of the game, 


E(m) = —m(m — 1) + m* + mn -m = mn. 


Expectation of Interval Length on Circle 


Solution 1 


Fix a direction on the circle. If |XY| denotes the arc length from point X to point Y, 
then |XY|+|YX]| = 1 and, by symmetry, the two intervals have the same distribution. 


1 
Thus, the average length of either is z 
If there are three random points then, by the same token, the expected length of 
1 
any of the intervals is 3 The same holds for two random points dropped onto the 


segment [0, 1]. Dropping n points on [0, 1] is equivalent to dropping (n +1) points onto 
the circle. This means that if one point is fixed on the circle, it falls into an interval 
which is the union of the two intervals with that point as one of the ends. It follows 
that P belongs to the interval whose average length is twice that of other intervals. 


1 
Thus, if there are n random points, the average arc length of an interval is —. With 
n 


n random points and one fixed, the expected length of the interval that contains the 


2 
fixed point is ; 
n+1 
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Solution 2 
N.N. Taleb 


We are looking for f(.), the probability distribution function of the distribution of 
z = max (| — y|,|-2 — y + 1): 


1 
4— 4z eet 


f) = 4z Gerz 


0 elsewhere. 


Waiting for a Train 


1 
There is a probability of 6 of the traveler missing train A by 10 minutes. The distri- 
bution of the waiting times is shown in the following table: 


10|30| 50 | 70 | 90 | 
lee ae ale =| 
TEI ; ; 


| Time in min 


| Probability | 


The expectation is: 


1 1 1 1 1 980 : 
10: 5 +30: 5 +50: zg t70: a, 90) = x 27 min. 


Notes 


A curious thing about this riddle is that, assuming that the traveler comes a little 
earlier as not to miss train A, say at 8:05, then the expected waiting time will grow: 


Average Number of Runs in a Sequence of Random Numbers 


Solution 1 

A run can start at any one term of the sequence but the last one and there is always 

a run that starts at the very first term. If pẹ is the probability of a run starting in 

position k, then clearly pı = 1 while pm = 0. The average number of runs is simply 
M-1 


M 
R= S pr, or R=1+ X pr. 
k=1 k=2 


Let nz be the number in the k*” position. The probability p, of a run starting at 
position k is complementary to the event that, in magnitude, nę is between its two 
neighbors: either Nk—-1 < Nk < Nk+1 OF Nk—1 > Nk > Nki. The latter two events 
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(3) 
N(N — 1)? 


N 
( 3 ) ways to select three numbers out of N and only one way to order them in a given 


are equiprobable and each comes with the probability of because there are 


order, either increasing or decreasing. There are N ways to choose the first number 
and N — 1 ways to choose each of the other two. Thus, the probability of a run starting 
at the kt? position is 


(3) 
N(N —1)(N -2 
Pr=1—2 3 =1-2 ( 
N(N — 1)? 3!- N(N — 1)2 
=i N-2 
~~ 3(N =T) 
= 2N-1 
~ 3(N—1) 
The average number of runs, then, is simply 
M-1 
2N—1 
R=1+ X py =14+(M -2) 


3(N — 1)’ 


Solution 2 
Joshua B. Miller 


An alternation is a transition from an up run to a down run. The number of runs is 
one plus the number of alternations: 
M-1 


M-1 
R(m,...,2m) := 14+ 5 |z; < min{xi—1, zi+1}] + 5 |x; > max{2i_1, £i+1}] 
i=2 i=2 


M-1 

E[R(a1,...,@u)]) =14+ 5 Pr (x; < min{£i—1, Zi+1}) 
i=2 
M-1 


+ 5 Pr (x; > max{zi—1, 2i41}). 
i=2 


Note 1 
N-1 


Pr (a; < min{a;-1, %i41}) = 5 Pr(min{zi—1, 2:41} > k |a; = k)Pr(a; = k) 
k= 


© (34) & 


_2N-1 
~ 6N-6' 


where the second line follows from conditional independence. 
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Note 2 


Pr (x; > max{xj_-1, 2i41}) = Pr(max{xj-1, ®i41} < k |x; = k)Pr(z; = k) 


IMs IM: 


Combining these together, 


2N —1 
E|R ... zM) =1+2(M -2 
[Renem] =1+2(M - 2) — 
2N—1 
=1+(M-2 : 
( 3N —1) 
Note 3 
2-2-1 ; . 
For N = 2, the formula reduces to 1 + (M RETE 1) = M —1, implying that the 
experiment is only possible if M < N. There are just two possible outcomes: 1, 2,...,N 
and N,N —1,...,1. This is due to the stipulation that no two successive selections 


may be equal. If the stipulation is removed, then the riddle becomes equivalent to 
finding the average length of the runs in coin tossing, which, as we know (Riddle 


M+1 
on pagefios), equals Z : 


Training Bicyclists on a Mountain Road 


Solution 1 
A. Bogomolny 


Let q = 1 — p be the probability that a bicyclist will not tumble on his own. Then the 
probability that bicyclist number k (counting from the head of the line) does not fall 
equals q*. 

Let Ip be the indicator function for the event that bicyclist number k tumbles down: 


1, if the kt? bicyclist tumbles 
Ik = , 
0, otherwise. 


Then P(Ip = 0) = q" so that P(Ip = 1) = 1 — q" and E(x) = 1 — q". 
We are interested in the expected number of bicyclists who eventually tumble, E(n): 


n n 


a(n) = XC E(k) = XC- 4) 


k=1 k=1 


l= _,_i-p 


=n q 
1-q p 
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Solution 2 
N.N. Taleb 


n tumble with probability p (after the first does). n— 1 tumble with probability p(1— p) 
(in the case where the second one tumbles). n — 2 tumble with probability p(1 — p)? 
and so on. 1 tumbles with probability p(1 — p)"~1. Summing up, 


i —1+(1- pt! +p+np 
Fae e SP) | 
1=0 p 
Solution 3 


Alejandro Rodríguez 
The probability that cyclist 2 is the first one to fall is equal to the probability that the 
previous i — 1 cyclist did not fall, (1 — p)*~!, multiplied by the individual probability 
of falling p: 
P; =p(1—p)**. 
If cyclist i falls then the remaining n — i clyclists fall too for a total of x; = n+1—i. 
The expected value of x is then equal to 


i=1 


Substitute and split the sum: 


n 


E(x) = >20 -p(n +1- i) 


n n 


= [m+ C0 -p - oi - p)'). 


P i=l i=l 


Now, we use the following results: 


@-p0-01-p") @-p0-0-p”) 
1- (1-p) (1 -(1-p)}? 
n(1 — eee 
1- (1-p) 


Elz) = — je +1) 


which yield 
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Number of Trials to First Success 


Solution 1 


We shall call an occurrence of V in a trial a success; a trial is a failure otherwise. 
The event will occur on the first trial with probability p. If that fails, it will occur 
on the second trial with probability (1 — p)p. If that also fails, the probability of the 
event coming up on the third trial is (1 — p)?p. 
More generally, the probability of the first success on the n* trial is (1 — p 


We are interested in the expected value: 


yr" p. 
E=p+2(1—p)p+3(1—p)*p+---+n(1—p)™ ‘pt... 

One way to determine F is to use a probability generating function: 
f(x) =p+2(1 — p)pa + 3(1 — p} pr? ++... +n(1 — p) tpr"! +.... 


By definition, E = f(1). (Note that, since, the series converges for x = 1, f(1) does 
exist.) To find f in closed form, we first integrate term by term and then differentiate 
the resulting function: 


F@)= | Kejdz = | Er + D0 = p)"pe"de 
n=0 


= ya -»)"p | (n+ 1)a"dx 


-Ea - "pe"! +0 


n=0 
P n+1,.n+1 
——) (1-p i r T +C 
1— p n=0 
p_ (1-p)r 
= C 
p iee 
px 
= C. 
1—(1-—p)x y 
Now differentiate 
P 
SPE S aT 
; p _ 1 : 
from which f(1) = — = >, as required. 
p p 


Solution 2, Shortcut 
There is a shortcut that makes the finding of E more transparent. Obviously, the 
expectation of the first success counting from the second trial is still Æ. Taking into 
account the first trial, we can say that, with probability 1 — p, the expected number 
of trials to the first success is £ + 1, while it is just 1 with probability p. This leads to 
a simple equation: 


E=p+(1—p)(£+1)=1+4+ E(1—p). 
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1 
Solving this equation for E gives E = -: 
P 


n=0 
=p )_n(l—p)"+p>_(1—p)” 
n=0 n=0 
=p n(l—p)"+p a= 
1- pp (n+1)(1- p)” +1 
n=0 
(1-p)E+1 


Waiting for an Ace 


Solution 1 
Problem 40] 


Four aces divide the deck into five parts, each of which may contain 0 to 48 cards. 
According to the symmetry principle (Appendix [B}, all five have, on average, the 


4 
same length: = = 9.6. The next card after an aceless run should be an ace—a card 


number 10.6. 
For this riddle, the symmetry principle may be formulated as follows: 


When n (uniformly) random points are picked on a segment, the n+1 parts 
so created have the same length distribution. 


Solution 2 
N.N. Taleb 


Note the successive waiting times, excluding events. We have the following 
probabilities: 


4 
l; 59? waiting time 0, 


2 1 z iting ti 1 
-3I E3 , waiting time 1, 


3. : 1 1 , waiting time 2 
50 52 51 


and so on. The expectation becomes 


48 i—1 4 48 
2 z 53) = 5 
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The expression could also be written as 


: i 4 
a7 4i +1) TT <0 (1- =) 48 
52—1—i Fe 


i=0 


The former seems self-explanatory. 


Notes 
N.N. Taleb, M. Wiener 


For a total sample size of n with the initial probability of p, the expected time to the 
first success is 


n+1 
se ieee 


1 
Note that, as n — ov, this converges to the solution — of the aforementioned 
P 


problem with replacement. 


Two in a Row 


Solution 1 


Let Xn stand for the result of the n* toss. We will look into the more general problem 
of having to stop at the n* toss. This would mean X, = Xn_1 = H and Xn—2 = T. In 
addition, the sequence of the previous tosses should not contain two consecutive heads. 

Such a sequence may start with either T or HT, and the remainder ought to 
be a sequence with the same property—no HH. Thus, we get a recurrence: Sk = 
Sk—1 + Sk—2, where S is the number of sequences with no HH of length k. Obviously, 
Sı = 2, S2 = 3, meaning Sk = Fy42, where Fn is the Fibonacci sequence 


0, 1,1,2,3,5,8, 13, 21,34,... andO=F. 


There are 2” possible outcomes for n tosses of a coin. Of these, S,—3 end with 
THH and have no repeated heads beforehand. Therefore, the sought probability is 


Fn- ; : 34 
P, = J A which for n = 10 gives Pig = Tod" 
Solution 2 
Amit Itagi 


Let Qg be the event that two consecutive heads do not occur in k consecutive tosses. 

A Qg event ends in one of the following patterns: TH, TT or HT. Suppose the coin 
is tossed one more time. The probability of Qk+ı would be the same as Q, if not for 
a possible ending pattern of THH arising from the TH ending pattern in Qk. Thus, 
we need to subtract the probability of the THH ending. Hence, 


P(Q) = PQr) — PQr). 


Solving for this recurrence relation, 


P(Qk) =c y + C2 
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C1, C2 and c3 are undetermined coefficients that can be solved for using the initial 
3 5 
conditions: P(Qi) = 1, P(Q2) = 7 and P(Q3) = 3 (obtained by enumerating the 


cases). This results in 


(5 — 3v5) [e2] (5 + 3v5) 


10 


(1+ V5) 


k 
| A ; 


P(Qk)= 


10 4 


In order to end with two consecutive heads for the first time on toss number 10, 
Q7 has to happen followed by THH. Thus, the required probability is 


1 pg = 62375) (1— v5) S (5 +3v5) | (1+ v5) o 17 
g tere BO 4 80 4 ~ 512° 
Solution 3 
N.N. Taleb 


We have the following Markov chain: 


© 


© 


It has the following transition matrix M: 


N H HH 
1 1 
N = 2 
5 9 ” 
1 1 
H -= 0 = 
2 2 
HH 0 0 1 
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The matrix M to the itè power is 


AEA |v)” 


V5 | V5 
G05)  (G(-¥9)) 
V5 V5 


272-2 (v5-1) (v5+1) HIO gaia (1-v5) (v5-1) (v5+1) 


V5 V5 
azai (V541) (1-VB)) 2-8 (v5-1) (V5) 


a iy fis í —2i—2 Ea 
a-a (y5-1) (1-v5) 22 (v5 +1) 2 


v5 v5 
EA E 
v5 v5 
1 


The matrix M after i steps, starting from position 1, is 


52-71 (5.241 4 (BVB— 5) (1 — vB)’ — (5 + 3V5) (1+ v8)') 


Hence, the cumulative probability P of having HH in i steps or fewer is 


P(i) = oS ((3v5 5) (1 v5) L5 22H (3v5 + 5) (v5 + 1)') l 


which yields the mass generating function, 


p = P(i)- Pli —1) = Pa (v3 +5) (1 v5) (v5 -5) (v5+ i). 
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By the Binet identity, where F is the Fibonacci number, 


1 


(V5 +5) —(v5—5)' 


Fi- = l 
1 2i-1 4/5 
POE eh 
Here 
34 
ATT 


The unconditional probability of being in the second game (conditional on both 
winning previous games but regardless of having been matched) is 


n 
1 22 
P2 = z2 


n 
2 
2 


The unconditional probability of being in the it” game is 


n 

1 2' 

Pi = Ji 7 n 
gi-1 

2 


p, =(1 Pi-1) 53 ee pipe = pi = p2)p3 
2i 
2 
m k—1 
P(n) =X px [0-2 
k=1 g=1 
logn 
log 2 


Solution 4 
Giorgos Papadopulos 


For 10 tosses, the ninth and the tenth should be heads (H). Therefore, we concentrate 
on the first eight tosses: 


1. All tails: (5) =. 


2. One head, seven tails. There are seven possible positions of H before and after 


T, excluding the very last, after all tails: (1) =7. 
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3. 2H, 6T. With the same reasoning as above, there are six distinct positions for 


H between T’s, including the front: H =15. 


4. And so on. 


The total probability is the sum of the cases above divided by 21°: 


pro = 9D a 


210 ~ 1024" 


In general, the formula is 


Poe S (" i J 


Waiting for Multiple Heads 


Solution 1, Case n = 3 


For the first appearance of three heads in n tosses, 
1. the last one needs to be a heads, 


2. the total number of heads needs to be three (in the limit, the rare event of no 
heads will play no role), 


3. the first two heads could be in any of the first n — 1 tosses. 


Thus, we get the expectation of the first occurrence of three heads: 


<= n—1)(n-—2 aa 3 
Bale) = Son ay tet =D 


(n — 1)(n — 2) 


z x”"—*, Integrate three times from 


Co 
To see that, introduce f(x) = 5 n 
n=3 


3 
3 
0 to x to get > oS TE Differentiate three times to find f(x) = eas Then 
r -zr 
n=3 


3 
substitute z = 1 — p and multiply by T 


Solution 2, Case n = 4 


For the first appearance of four heads in n tosses, 
1. the last one needs to be a heads, 


2. the total number of heads needs to be four (in the limit, the rare event of no 
heads will play no role), 


3. the first three heads could be in any of the first n — 1 tosses. 
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Thus, we get the expectation of the first occurrence of four heads: 


Follow the steps described above, but with four integrations followed by four 
differentiations. 


Solution 3, Markov Chain 
Josh Jordan 


N 
It seems reasonable to conjecture that on average one will need to wait — flips before 
p 


getting N heads the first time. I will describe a process of obtaining this result for 
N = 5 that will make the assertion obvious. The process is based on constructing the 
Markov chain that describes flipping a coin and counting the number of heads. 


There are six nodes, corresponding to the number of heads from zero to five. The 
one at the bottom corresponds to a zero count of heads. This is, say, state 1. We move 
from one state to the next with the probability of p and stay in the same state with 
the probability of 1 — p. 

There are six variables u, v, w, x, y, z corresponding to the average number of 
coin flips that are yet required to get to the last state of having five heads. These six 
variables are connected as in the following system: 


w= 1+ (1 — p)w + pe 


v= 1+ (1-— pv + pw 


u =1+(1-— pu + pv. 
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The free coefficient 1 tells us that the count of heads changes with every transition 
from state to state. The system can be rewritten: 


py=1 

px = 1 + py 
pw = 1+ pr 
pv = 1 + pw 
pu = 1 + pv. 


A step-by-step substitution from the top down shows that py = 1, px = 2, pw = 3, 
pv = 4, pu = 5. The first four confirm our previous derivations; the last one confirms 


5 
the conjecture, E5(p) = u = —. The way it was obtained shows how the general 
P 


conjecture can be proved by induction. 


Solution 4, Markov Chain by Mathematica 
Marcos Carreira 


stts = {"Start","Ts","TsH", "TsHH", "TsHHH", "TSHHHH"}; 


0 1-p p (0) (0) 0 

0 1-p p 0 0 0 

0 0 1—p P (0) 0 
tmat = ; 

0 0 0 1—p p 0 

0 0 0 0 1—p p 

0 0 0 0 0 1 


proc = DiscreteMarkovProcess[1, tmat]; 
MarkovProcessProperties[proc]; 


Graph[stts, proc] 


Grid[Table[{j — 2, stts|[j]], Mean[First PassageTimeDistribution[proc, j]]}, 
{j, 3, Length[stts]}], Frame — All] 
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1 TsH l 
P 

2 TsHH 2 
Pp 
3 

3 | TsHHH | =- 
Pp 
4 

4 | TsHHHH | — 
Pp 

Solution 5 

Amit Itagi 


Let q = 1 — p. Suppose we get k heads for the first time after n flips. Then the first 
n — 1 flips should have exactly k — 1 heads and the last flip has to result in heads. The 
probability of this happening is 


Z(n,k) = C(n—1,k — 1)p g". 
Thus, the expected number of flips before getting k heads (assuming k > 0) is 


f(k) =X nZ(n, k) =X nC(n-1,k - 1)p*qr* 
n=0 


n=0 
= = k k n-k 
=} n |—C(n,k)| p*a 
n=0 n 


= kp" XL O(n, k)" 


n=0 
k co 
= n(n—1)...(n—k+1)q™* 
` n=0 
k k © 
et Se a 
-u-t 
Ja p æ ı 
— (k=1)! ôg 1-q 
3 93 
p os -l 
m= 2! Og 1-q 
oP E 
2! (1-44 p 
4 94 
po 1 
ijen 
F(A) 3! ðqt1 -q 
pt Al 4 


I= p 
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Solution 6, Mathematica Again 
N.N. Taleb 


Generalization of a problem: flipping a coin, with probability of heads p, 0 < p < 1, 
what is the expected number of flips before getting 3,4,m heads? 


Nonconsecutive Heads 


For three nonconsecutive heads, we denote by x an event we do not count and H the 
heads on the coin: 


x tH xHxH xHxrHxrH 


x l-p p 0 0 
xH 0 l1—p p 0 
xHxrH 0 0 l-p p 
cHxHxH 0 0 0 1 


with transition matrix 


1l—p p 0 0 

0 l-p p 0 
T= 

0 0 l-p p 

0 0 0 1 


= 0 (1—p)” n(1—p)""*p 
0 0 (1-— p)” 
0 0 0 


1- 30- p)" = 2)p((m = Lp + 2) +2) 
(np +p- 1)(1- p) +1 


1- (1- p)” 
1 


We have the transition probability from the initial state Sọ = x to Sn = xH£HxH in 
n throws: 


P(SalSo) = 1- Z0- p)" — 2)p((n — 1p +2) +2). 


Hence, we can derive the probability mass function (pmf) of the stopping (first 
passage) time distribution (indexed for three heads): 


(n= 2)(n- 1)? (1 -p"™, n>3 


1 
3,n = p(Sn—-1|S0) — p(Sn| So) = 2 


and the expected stopping time: 


Co 


3 
So non = rs 


n=3 
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For four and more nonconsecutive heads, 


1l-—p p 0 0 0 
0 l-p p 0 0 
T4 = 0 0 l—p p 0 
0 0 0 l-p p 
0 0 0 0 1 
1 
and ¢4.n = —(n— 3)(n — 2)(n— 1)p*(1 — p)"~*. The expected stopping time is 
: 6 
= 4 
X NDA,n a 
n=4 p 


Generalizing, for m heads, 


1 
m! 


bmn = — (n - m)(n =m — 1)... (n — 3)(n — 2)(n — 1)p” (1 — p)”, 


or, more compactly, using the Pochhammer symbol for the rising factorial, 


1 m n-m 
m,n = m” _ M)mP (1 —p) G 


Finally, 


Co 


5 NỌm,n = = 


n=m 


Consecutive Heads 


For three consecutive heads, 


x H HH HHH 


x l-p p 0 0 
A 1l1-—p 0 p 0 
HH 1-p 0 0 p 
HHH 0 0 0 1 


The matrix and the pmf of the first passage time distribution are unwieldy, but we 
get for the expected stopping time 
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Probability of Consecutive Heads 
Signor Ernesto 


As was observed in the previous riddle (see also (92}), the expectation of the number 
of tosses until 10 successive heads is 


10 


10 ~ (pm — p10" 
p (p — 1)p 
pt? =Í 
We are looking for P that satisfies E = @—1p™ = 100. This can be found 
Pp— 1)p 


approximately: FE ~ 0.712212. 


Two Dice Repetition 
Problem 409] 


Solution to Question 1 


This is very much like in the Birthday Problem [97], but with 36 distinct equiprobable 
selections. 

A repetition occurs on move n, 2 < n < 37, for the first time if, until then, there 
were no repetitions, but then on the nt! move there was one. The probability of this 
event is 


_ 35 34 36-(n-2) n-1 
~ 36 36 36 36 
_ (n= 1)36 - 35- (38 — n) 
7 36” 


p(n) 


The expectation for the number of moves is then 


37 
E = X _ kp(k) ~ 8.203. 


k=2 


Solution to Question 2 
Now, for n, 2 < n < 32, the first repetition can occur at move n in one of two ways: 


1. there was no x in the first n — 1 moves or 


2. there was exactly one x in the first n— 1 moves, whereas x may or may not appear 
on move n. 


Let q(n) be the probability of the first repetition happening on move n. Then, to 
start with, 


30 1,6 6 1 


J=. eee 
a2) = 35 ` 36 + 36` 36 7 216 
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For 3 < n < 32, 
_ 80-29---(82—n) n-1 
7 36"-1 36 
6 30-29---(33-—n) 6+(n—2) 

1)- : . 

(n-1)- 35 367-2 36 
30+ 29--- (33 — n) 
B 36” 


q(n) 


. (n — 1) (5n + 56). 


The expected number of moves at which the first repetition occurs is then 


32 
XC kq(k) =~ 6.704. 
k=2 


Expected Number of Happy Passengers 


Solution 1 
Konstantin Knop 


Assuming the plane seats n passengers, let F'(n) be the expectation of the number 
of unhappy passengers and F*(n) be a similar expectation, conditioned on the first 
passenger getting a wrong seat, so that the two are related as follows: 


F(n) = = 0+ —— - F” (n). 


F* (n) is thus the number of unhappy passengers in an n-seat plane with an a priori 
misplaced seat. 

Let us now start from the beginning, but counting passengers from the end of the 
line. When the nt! passenger (the one without the boarding pass) enters the plane, he 


1 
chooses any of the n available seats—in particular, his own—with probability —. With 
n 


the same probability, he lands in the seat of the kt passenger (from the end of the 
line). If this happens, all the passengers entering before the kt? passenger will be happy. 
The latter will have a choice of k seats, of which one is that of the first passenger. With 


1 
the probability of —, he will choose the latter, making the total number of unhappy 


k-1 
passengers two. With the probability of F he will choose a seat of one of the 


passengers yet in line, adding one to the count of unhappy passengers. This would 
reduce the problem to that of a k-seat plane (with one seat a priori misplaced), which 
may be described as adding to the total expectation of unhappy passengers the quantity 


2 k-1 ean cay R+1, kal 
pt a eee ea r F(E) 
_ k+l, 


—— + F(R). 
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Summing up for all possible choices of k, 


ra= iot S (EH 4 mw), 


k=1 


raS (Aer H) 


For F(n — 1) we have a similar expression: 


a-D Fn => (EH rw) 


Substitution of the latter into the former produces 


TA T E rH +F(n—1), 


or 


F(n) = F(n—1) + —. 


With F(1) = 0, we are able to solve the recurrence: 
0-4 
k’ 
k=1 
so that F (100) ~ 5.177. 


Solution 2 

A. Bogomolny 

Passenger number n, counting from the end, gets his/her own seat with the probability 
n 

n+1 

matter whether the nt? passenger’s seat was among them—another previous passenger 

could have taken it. See also solution 4 in the predecessor riddle on page|158}) Define 


. (This is because the previous passenger was facing n+1 choices, and it does not 


and value 0 


the random variable X(n) that takes value 1 with the probability k i 
n 


n 
n+l 


1 
with the probability PR Then the expected value is E(X(n)) = 
n 


1 
The first (to board the plane) has the probability of T0 to be happy. Then (using 
the linearity of the expectation) the expected value E(100) of happy passengers is 


99 
n 1 
100) E(X — 7 94.823. 
Hh gt LM )= dot + T 


The number of unhappy passengers is then about 100 — 94.823 = 5.177. 
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Expectation of the Largest Number 


Solution 1 
The following is an anonymous letter submitted as a solution to Riddle 


Dear Alexander, 

I am writing about your symmetry principle (Appendix [B) as you applied it to 
the riddle of the expected number of cards to be dealt in order to get the first ace 
(Riddle on page (194). 

I came up with the probability puzzle (this riddle) and had been working on it. 
After tediously solving it by normal means, I realized that my problem was the same as 
your ace riddle. I used the same symmetry-principle logic as you did to solve your ace 
riddle: the n drawn balls divide, by rank, the remaining N —n balls in the urn into n+1 


. N-n 
groups of average size 
n+1 


. The last group is from ball number Fes “(N+ D| +1 
n 

to ball number N. The ball just before the first ball of the last group is the expected 

highest ball drawn: 


E(max) = Fee (N+ D| l 


It occurred to me that a little more can be squeezed out of these riddles with more 
use of symmetry. By symmetry, the expected minimum is given by 


which is the same as the answer to your ace riddle. The formulas for expected minimum 
and maximum can be generalized to any rank: 


k 
n+1 


BK) =| wn], 


for k = 1 to n, where k is the rank from smallest to largest. 


Solution 2 
Amit Itagi 


For k to be the largest ball, choose k and n — 1 balls from the first k — 1 balls. Thus, 
the required expectation value is 


p Zelaa) = a N a AN 
o OC) 
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Solution 3 

N.N. Taleb 

Drawing without replacement, we can make use of the hypergeometric distribution for 
the distribution of Z, the max of an n-size subsample of 1,2,..., N with mass function 
P(Z = z) = ¢(.): 


N-z-1 
el ey = Woe 


The expectation can be written as follows: 


2) = YON- D, N, d) TT 0-60.59) 
i=0 j=0 
I(-N) 
n (nv cea ira) ntN+1) 
i (n+ 1p 
o nN FOL nsin(an)D(n + 1) ese(aN) sin(a(n — N))P(—n +N +1) 
= n+1 nti m(n+1)P(N +1) 
=0 

o n(N +1) 
= n+l 


Drawing with replacement, we have a discrete uniform distribution with cumulative 


F(n) = £. The cumulative probability of Z is, by standard argument (all x below z), 


F(Z>2)= (=). 


The expectation becomes 


Waiting for a Larger Number 


Solution 1 
Talking of probabilities, we disregard the possibility of getting two equal numbers as 
having zero probability. Further, 

P(N >n) = P(So > S1, So > S2,...,59 > Sn) 


because, from the definition of N, none of the previous numbers exceeded Sọ. Put 
another way, 
P(N > n) = P(max{ So, S1, So, sae Sn} = So). 


Now, here is a crucial observation: all n + 1 numbers are uniformly distributed on 
[0, 1], so there is nothing special about Sọ, and, in principle, any of them could be the 
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largest. Thus, P(N > n) = 
of symmetry. See Appendix [B] 
P(N =n)= P(N >n- 


To continue, 
E(N)=¢ n : 
7 n(n +1) 
n=1 
strange as it may appear. 


On one hand, P(N > n) 


n+1 


. (This is an instance of invocation of the principle 
1 

) Similarly, P(N > n-— 1) = —. Now, 
n 


14 
n+1 n(nt+l1) 


1) — P(N >n) => 


Co 


1 
= eae 


n=1 


1 
= ea indicates that not finding a maximum after, 
n 


1 
say, 99 attempts has a fairly low probability of T00' On the other hand, the expected 


waiting interval for that to happen is infinite. 


Solution 2 
Josh Jordan 


By symmetry, E[N] is unchanged if we stop instead when Sy < So. For any given So, 


this takes (Ridatefs. fon page 


1 1 
194) — tries on average, so E[N] = —. Finally, since 
So So 


So is uniformly distributed over [0,1], we have 


E[N] = (area under the curve — for 0 < So <1) = œ. 


Solution 3 
Amit Itagi, Marcos Carreira 


The integral of (1 —x)(1+ 2x4 


is just the derivative of 1 + £ + xz? +... with respect to x = 


1 
from 0 to 1 diverges. 
2 


Solution 4 
N.N. Taleb 


1 
So 


| 3x7+...) from 0 to 1 is the solution. The second term 


1 
iea a The integral of 
=f 


The probability of first exit on the first shot is P(S1 > So) = 1 — So. The probability 

of first exit on the second shot is P(S1 < So, S2 > So) = So(1 — So). The probability 

of first exit on the t! shot is P(S1 < So,..., 5-1 < So, St > So) = Ss — So). 
The conditional stopping time (knowing So) is 


E(N|So) = XC tS (1 — So) = 
t=1 


1 
1— So 
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The unconditional stopping time (not knowing So) is 


: . H der i ; 1- L 
lim lim = lim lim ln | —— ] = œ 
H>1L>0J,p 1l-—x H>1L>0 1-H 


Waiting to Exceed 1 


Solution 1 
Marcos Carreira with Mathematica 


Refine[Table[CDF[UniformSumDistribution[n], x], {n,1,6}],0 < æ < 1] 
7 L? z? gt z gê 
? 2° 6’ 24’ 120’ 720 
Table[CDF[UniformSumDistribution[n], 1], {n, 1, 6}] 
1 111 1 1 
"2’ 6’ 24’ 120’ 720 
1 — Table[CDF[UniformSumDistribution[n], 1], {n, 1, 6}] 


0 1 5 23 119 719 
"2? 6’ 24’ 120’ 720 


Differences[1 — Table[CDF[UniformSumDistribution|[n], 1], {n, 1, 6}]] 
Ded dy) Te I 
2’ 3’ 8’ 30’ 144 

Differences[Table[CDF[UniformSumDistribution[n], 1], {n, 1, 6}]] 


B le sa 
2°38’ 30’ 144 


pen (in =a (ao is si) 


—l +e 


Solution 2 
Amit Itagi 


Let Ta = J ko Sn: 


1 1—To 1-Tı 1—Tn-1 
P(T, < 1) =| I e dS;,dSn_1...dSp. 
So=0 J S1ı=0 JS2=0 Sn =0 


Let us make an observation here. For some natural numbers k and j, 


1=Tk-1 (1 — OA Pet (1 — Tpi — Sp) 
ee 
Sp =0 J: Sp =0 J: 
z Anaa 74 
(j +1)! o 
(1 — Tk-1)t! 


(j +1)! 


(8.1) 
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Note, this result is also valid if 7 = 0. 

(1- Th)! 
j! 

with j = 0. Thus, the integrand keeps propagating through the integrals from right to 


left with k decrementing by 1 and j incrementing by 1 with every integral. Thus, we 
are left with 


1 al 
(1 — Tp)" | (1 — Sp)” 1 
P(T, <1)= =o dS) = Sn 1s. =. 
( ) Ja n! ae M ° (n1)! 


The integrand for the rightmost integral in equation. ean we written as 


The desired expected value is 


E(n) = S eP <1NT,, > 1) 
= Yon: [P >1)-P(T,-1 > 1NT, > 1) 


= 5S n- [P(Tn > 1) — P(Ta-1 > 1)] 


al ee 

— & n! (n+1)! 

= 1 1 9 1 1 

mie A ee 
1 1 1 
= (2 1)-5 +6 2): a+ 
1 1 1 

= Po ag ee Le 


Solution 3 
pp. 94-98] 
Let Tn = J p-o Sn and N be the first value of n for which Tn > 1: 


P(N > n) = P(o <1,T% <1,...,Tn <1) = P(T, < 1). 
It follows that 
P(N =n)= P(N >n-1)-— P(N >n) 


= P(Tn-1 <1)— P(Tn < 1). 


Chapter 8: Solutions 225 


We also know that P(N = 0) = 0. Thus, 


EN) = Jo nP( nS ne G3 < 1) = P(T, < 1)) 


n=1 n=1 
oo 

= 5 nP(Ta-1 < 1) -Xr P(Tn < 1)) 
n=1 n=1 


a P(T, <1)— S nPI, < 1)) 
n=0 n=0 


1 
Now, P(To < 1) = P(So < 1) = 1. By induction, P(T,, < 1) = miD so that 
n ! 


Co Co 


BN) =O PCa <= aay seb 
n=0 ` 


n=0 


Notes 


Previously T„ was defined as the sum of n + 1 random variables. In a more common 
formulation, we would consider T, as the sum of n random variables. In that case, 


1 
the result would appear slightly different because then P(T, < 1) = —, 50 that we 
n! 


would have 


Solution 4 
Problem 100] 


Call E(x) the expected number of number selections for the sum to exceed x. Since 
the first number is uniformly distributed on [0, 1], 


sais | B-d 


but E = 0 for x < 0, so if 0 < x < 1 this relation can be converted into 


Be)=14 f Bæ- ùd =1+ f E(u 


It follows that E(x) = E(x), E(0) = 1 and so E(x) = e”. In particular, E(1) = e. 


Waiting for All Six Outcomes 


Solution 1 


1 
Waiting time for an event that occurs with probability p is known to be — (Riddle 
P 


on page [194). 
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Assume we already saw k < 6 numbers. The probability of seeing one of the remain- 


6 
6—k 
Naturally, we start counting from k = 0 and the first roll of a die is always a success: 
one of the numbers shows up. Knowing that the expectation is additive, the expected 
number of rolls until all six numbers show up is 


5 
6 1 1 1 1 1 1 
Dae Reg tes 5 +t) <r. 


6—k 
ing 6 — k numbers after that is aan making the waiting time equal to 


Solution 2 
David Koops 


Let t(n) be the number of throws until all outcomes appear, starting from n distinct 


n 
outcomes. On the next throw, with the probability p we do not get an outcome that 


6-—n 
we have not seen before, and, with probability eo we get a new outcome. Solve 


t(n) =14 = t(n) ' Eon t(n +1) 
for 1(0) with ¢(6) = 0. Explicitly, t(n) — t(n + 1) = ——. Thus, 
i(5) - 16) = Í 
i(4) - (5) = $ 
1(3) -t = $ 
(2) -(3) = Ê 
TETOR 
0 -= 5. 


5 
6 
Summing up, t(0) = > T 14.7. 
-n 
n=0 


Walking Randomly—How Far? 


Solution 1 


If x Æ 0 for some n, then an+ı = an because, with equal probabilities, x changes to 
either x + 1 or x — 1. If x = 0, then necessarily |x| grows by 1. 


2k 
If n is odd, the probability of x = 0 is 0. If n = 2k, there are ( k ) equiprobable 


ways to have the same number of tails as of heads. Thus, the probability of x = 0 if n 
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is even is 


NIZ z 


Qn 
This leads to the following recurrence: 


Gn; if n is odd 


An+1 = 


wl ss 


an + ~~, if nis even. 


Qn 


Solving that, we obtain 


2k 
It is not hard to roughly estimate the central binomial coefficient ( k ) It is the 


2k 
2k 2k 
largest among 2k+1 coefficients in the sum 5 ( ) = 2” so that ( k ) (2k-+1) > 2”. 


Ss 
s=0 
2k 
k 1 


——. We conclude that an — oo, for 


In other words, 92h > Sd 


(k) 

“\k 2 Í 
ae Sram 
k=0 k=0 


and the latter series is known to converge to infinity. 


Solution 2 
Amit Itagi, Hélvio Vairinhos, Keith Dawid 


Let y be that random variable that represents the incremental change in x because of 
a coin toss. Thus, y takes the following two values: —1 and 1. It has a mean of u = 0 
and a variance of g? = 1. The central limit theorem implies that after a large number 
n, of coin tosses, x has a normal distribution with mean ff = nu = 0 and variance 
a? = no”. Thus, z = |z| has a half normal distribution with expected value 


O V2 ovyn | 2n 
= yr yrn ON 


As n > œ, an > œ. 


an 
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Solution 3 
N.N. Taleb 


Let x be Sn, the summed variables; let X follow a binomial with probability density 


function n(x) = a Š i 
n-2£ 
We have 


We have 


lim = 


2 
In other words 
aaa Lf 
sas IAD = Vo 


Solution 4 
N.N. Taleb 


The binomial probability density function is 


n(x) = 27" a 


n 
Adjusting for 2 le — n to adapt it to an absolute sum, we have the characteristic 


function WV: 
Ux(t) = X go C) (exp (i2: (z = 5) 
x 2 
z=n/2 
n 1 nn+2 
Tep = F,(1.-—: — e2it 
G T 241 ( f g? 2 3 ) 
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and the mean 


n 1 
rlo+e 
or(5 +5) 


—iV'y (t)|r=0 = 


as before. 


Solution 5 
N.N. Taleb 


The characteristic function of a Bernoulli distribution that delivers {—1,1} is Y x(t) = 


1, 
e +e) = cos(t). Hence, the moment of order p of the sum is 


_ „p OP cos” (t) 


Mp =i : 
£ OP lio 
with mi = 0, mz = n, m3 = 0, m4 = n (3n — 2). 
m 
Using kurtosis as a benchmark of the speed of convergence to Gaussian, s = 
Nhe 


2 
we have k = 3 — —. 
n 


For n large enough, the distribution is effectively Gaussian. Let y(x) be the normal 
distribution with mean 0 and standard deviation yn. We have 


f. kee) te =f? va 


as the converging expectation. 


Expectation of Pairings 


Solution 1 
In general, let us consider a group of m boys and n girls. Let P, be the probability of 
having a couple boy/girl or girl/boy in positions (k,k + 1), k = 1,2,...,m+n-—1. 
From the point of view of chance, all Pẹ are the same; Pk = P, k =1,2,...,m+n-—1. 
Thus, similar to a problem of coin tossing (Riddle [8.3] on page [192}, the expectation 
E in question is E = (m +n — 1)P. 
To find P, note that once a boy and a girl find themselves next to each other, 
n+m—2 
—1 
multiplied by 2 because we do not distinguish the order in which the selected couple 


the remaining boys and girls can be arranged in ( ) ways. This should be 


: n+m ; 
comes in the sequence. On the other hand, there are ( ) possible arrangements, 
n 


implying 
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We conclude that 


2 
E=(m+n-1)P=—" 


ntm 


4 
For m = 8 and n = 12, this gives = 


Solution 2 


There are 


m 

ways to choose a boy and a ways to choose a girl from the 
n-m 

remaining group. The probability of their being next to each other is 


m m = 2nm 
n+m n=-m-1 (n+m)(n+m-1) 


This could happen in n — 1 positions, making the expectation equal to 


2nm 2nm 
E= 1 i = 
venues) (nt+m)(n+m—1) ntm’ 


the harmonic mean of m and n. 


Making Spaghetti Loops in the Kitchen 


Solution 1 
P. Winkler 


We compute the probability of making a loop at every step and then, using linearity 
of expectation, obtain the expectation by summing up the probabilities. 

Having tied k—1 knots, you pick up an end, and, of the remaining 100—2(k—1)—1 = 
101 — 2k ends, you can choose another to tie to this one. Only one choice (the other 
end of the chain) results in a loop. It follows that the probability that your k*t? knot 


1 
makes a loop is Tor = 9k? and the expected number of loops is 
1 1 1 1 
pte by 2D. 4 
99 + 97 + J5 tere I 93777485, 


which is less than three loops! As n grows, the expected number of loops approaches 
the nt” harmonic number H,,. 


Solution 2 
A. Bogomolny 


Let N(m) be the expected number of loops after tying m knots. Assume we have 
tied k — 1 of them so that 102 — 2k loose ends remain. We pick one arbitrarily and 
then another one from the remaining 101 — 2k to tie to the first selection. With the 


we will get an additional loop, and, with the probability of 


1 
bability of ——_ 
probability o 101 — 2k’ 


100 — 2k 
A the number of loops will remain the same. Thus, we get a recurrence relation: 
100 — 2k 1 1 
NWS ea O ea a og: 
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Unfolding the recurrence, we get 
1 1 1 1 
N(k) = — + + — +- + = & 2.93777485. 
(k) 99 97+ 95 "I 


Repeating Suit When Dealing Cards 


The two cases are best represented with tree diagrams. 
Drawing cards without replacement: 


In the first case, the expected number of cards is 


12 39 24 39 26 36 39 26 13 _ 68053 


3.27. 


2. aa y KEL nd E x 
TA 51 T 51 50 49 7 51 50 49 20825 


In the case of drawing with replacement, the expectation is 
1 3 2 4 3 2 36 3 2 1 103 


+3 | 


4 4 4 4 4 49 4 4 4 32 enact 


Family Size 


Solution to Questioins 1 and 2 
The first two questions are two specific cases of the general riddle of “Number of Trials 
to First Success” (Riddle [8.9] on page |194). With equal probabilities of birth for boys 


1 1 
and girls, i.e., with z the expected waiting time is — = 2. 
P 
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Solution to Question 3 


Assume p is the probability of having a boy, 1 — p that of having a girl. The sample 
space for our riddle consists of just two kinds of sequences: 


BB... BG and GG...GB. 


For a sequence of length n, the probabilities are 1 — p”~'p and p”~1!(1—>p), respec- 
tively, for n > 2. The mathematical expectation is then 


XOn(( -ptp + p° (1 — p)) 


=p- p) P (a-p)? +p?) 

_ 2—-p 2- (1-p) 
iced) (cer Ra cecer) a) 
pte 

= pl-p) 


1 
This result sits well with the formula E = — for the expected length of a sequence 
P 


of trials until the first success. Since the series in the latter starts with p which is 
omitted as meaningless in the present problem, we would get 


v) | e: p ») Wey 
l—-p+p 
p(1— p) 


1 
is at its minimum when p = 5 and goes to infinity as p approaches either 0 or 1. 


confirming our calculations. The graph of f(p) = shows that the expectation 


72 4 s 3 i (trom -2 to 3) 
1 ' 
For p = 5 the expectation equals 3. 


Solution to Question 4 


The sample space for this riddle consists of sequences 


GBB...BG and BGG...GB 
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that, for length n > 2, come with probabilities (1 — p)?p"~? and p?(1 — p)"~?, respec- 


tively. The expectation is then given by 


E=)_n((1—p)*p"? +p- p)""”) 
n=2 
1-p ni P S i 
= COPS np" + PY nl =) 
Pp n=0 ( ) n—2 
eer p? 
= S(p S(1-— p), 
p’ (p) (Lp) Pl 
where S(q) = 5 nq”. 
n=2 
¢ (2-4) feta, ah 
This leaves computing S(q) = (ia? so that the expectation in this case is 
given by 
g- 0P Pe—p), pe (L~p)* +p) 
pP ep) (pe p 


=(2-p)+(1+p)=3. 


This expectation is independent of p, which is a great surprise in its own right, but 
there is more. Note that the formula has been derived under an implicit assumption 
that p is neither 0 nor 1. What if, say, p = 0? What if, by a fluke of fate, women stopped 
producing boys. Then obviously the only possible child component in a family would be 
two girls. In another extreme (p = 1), all families would consist of two boys. Although 
neither case is plausible, I think that the circumstance merits being mentioned as a 
naturally occurring discontinuity. 


Solution to Question 5 


This is just a reformulation of question 3. 


Averages of Terms in Increasing Sequence 


Solution 1 
Graham Lord, Faryad Sahneh 


4 = 
The number of k-tuples X in which x; = m is & 2 & J Indeed, we are 
1> =3 


free to choose 7 — 1 numbers from the first m — 1 and k — j numbers from among 
m+1,...,n. Thus, the total number of k-tuples equals 


STR 


m=Jj 
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For the average then, 


aE (G) 
i ž E 
a2 uaea 


=g 


a eea ie 


where m’=m+landj’=j+1. 


Solution 2 
Long Huynh Huu 


First, we use the bijection 
o:S5 > S 
(£1,..-,£k) > (T2 — £1,..., En — z1, Nn + 1-— z1). 


2Sa 


res 


1 
=z > oe) 


res 


=3 ena 


res 


8! 
a 


=f. — 74. 


Likewise, we obtain Zj+1 —Z; = Z1 by using ¢ instead of ¢ where j = 1,... 


This shows that 
Tj = JX for j =1,...,k. 
Now consider the bijection 
w:S 3 S 


(@1,..-;%%) œ (n+1-—zk,... n +1- z). 
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Summation is commutative, so we get, for each j = 1,...,k, 


Tı -= Ea 


res 


= X yle) 


res 


=5 onti-m 


res 


=n + 1 — Tk 


=n +1- kū 


by equation (8.2). 
+1 


We conclude zı = a P so that a = an 


Solution 3 
Christopher D. Long 


For the number of different such sequences, consider k red 1s and n — k blue 1s, each 
of which may be placed at one of the k + 1 locations between the red 1s. This gives 


n 
( J such sequences, but it shows us more. 


There is only one location for blue 1s that increases the value of xı, but two for 
£2, etc. Thus, E(x;) = i- E(x1). What is E(x1)? 

Think of the value of each x; as equal to the number of red + blue 1s up to and 
including the i” red 1. Each blue 1 has k + 1 locations (before the first red 1, between 
the first and second red 1s, etc.). The first location increments all x;, the last none. 


Thus, the expected value each adds to xı +---+ £p is the average of 0,1,...,k, 
or =. 

2 

k k+1 

Thus, each blue 1 adds an expected value of 5 to X E(a;), so E(z1)- k- E 

k+1 k n-k n+l . nti 
k. 5 +(n—k)- 5° Thus, E(xı)=14 i ee and E(z;) =j: rI 
Solution 4 


Christopher D. Long 


Casting the riddle as k + 1 sequential urns makes it visually clearer. The value of x; is 
i+the sum of the balls in urns 1,...,i, and we are placing n — k balls in the k+ 1 urns. 


The probability that each ball increases the value of x; is k i i (placed in urn 


Z1,- -, Zi); hence, the expected value of E(x;) is immediately 
a n+1 
k)- =i. ; 
i+(n—k) ei a 
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Recurrences and Markov Chains 


...the probable, I have said, is that which is it rational for us to believe 
that the probable is true. 


John Maynard Keynes, 2014, A Treatise on Probability 


Riddles 


9.1 A Fair Game of Chance 
Problem 494] 


Alice and Bob play a fair game repeatedly for one nickel each game. If originally Alice 
has a nickels and Bob has b nickels, what are Alice’s chances of winning all of Bob’s 
money, assuming the play goes on until one person has lost all his or her money? 


9.2 Amoeba’s Survival 


A population starts with a single amoeba. For this one and for the generations there- 


3 
after, there is a probability of i that an individual amoeba will split to create two 


1 
amoebas and a 1 probability that it will die without producing offspring. What is the 


probability that the family tree of the original amoeba will go on forever? 


9.3 Average Visibility of Moviegoers 

At a movie theater, moviegoers line up to buy tickets. The ticket seller calls a patron 
viewable if he/she is taller than all the people in front of him/her in line; otherwise, 
the patron is hidden. Given that no two patrons are precisely the same height, what is 
the average number of viewable patrons among all possible permutations in a line of 
n moviegoers? 


9.4 Book Index Range 


The index of a book lists every page on which certain words appear. To save space, 
these are listed in ranges; for example, if a word occurs on pages 1, 2, 3, 5, 8 and 9, 
then its index contains ranges: 1-3, 5, 8-9. 

A certain word appears on each page of an n-page book, n > 0, independently with 
probability p. Find the expected number of entries in its index entry. 
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9.5 Probability of No Two-Tail Runs 


A fair coin is tossed n times. What is the probability that two tails do not appear in 
succession if n = 10? 


9.6 Artificially Unintelligent 
Chapter 11] 


An electronic device is being devised for transmitting 1 bit of information at a time. 
So far, the device is not 100% reliable. With the probability p, the bit is transmitted 
correctly; otherwise, it is flipped. 

n such devices are chained output to input and the first receives a bit. What is the 
probability that the bit will be transmitted correctly through the chain of n devices? 


9.7 Two Sixes in a Row 
pp. 54-55] 


What is the expected number of tosses of a fair die until two successive sixes show up 
for the first time? 


9.8 Matching Socks in Dark Room 
Problem 184] 


The colors of 10 pairs of socks range from medium gray through black. Washed and 
dried in a dark room, they have to be matched. Two socks form an unacceptable pair 
if their colors differ by more than one shade. 

What is the probability of having 10 acceptable pairs when they are being matched 
at random, due to the light conditions? 


9.9 Number of Wire Loops 
Problem 7] 


Start with 2018 pieces of wire. Next, attach each end of a piece of wire to another end, 
choosing randomly so that all pairings are equally likely (including attaching the two 
ends of a piece to one another). When this process is completed, there will be loops 
(no wire ends unattached), and the number of loops will range from 1 to 2018. 

What is the average (expected) number of loops? 


9.10 Determinants in Z2 
The problem was offered at the 1976 Student Olympiad at the Department of Mathematics 
of Moscow State University. 


Find the probability p, that a randomly selected determinant of order n with elements 


from Zz is not 0. Prove there is a limit lim py. 
noo 


9.11 Gambling in a Company 
pp. 149-150] 


In a game of n gamblers, the i‘ gambler starts the game with a; dollars. In each round, 
two gamblers selected at random make a fair bet, and the winner gets a dollar from 
the loser. A gambler losing all his money leaves the game. The game continues as long 
as possible, i.e., until one of the gamblers has all the money. 
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1. What is the expected number of rounds to the end of the game? 
2. What the probability that the it gambler ends up with all the money? 


3. For n = 3, what is the expected number of rounds until the first loser quits 
the game? 


9.12 Probability in the World Series 
Problem 1981-4] 


The winning team in the World Series must win four out of seven games. The two 
teams are equally matched. 
What is the expectation of the number of games played? 


9.13 Probability of a Meet in an Elimination Tournament 
Problem 297| 


A tennis club invites 32 players of equal ability to compete in an elimination 
tournament (the players compete in pairs, with only the winner staying for the next 
round). What is the probability that a particular pair of players will meet during the 
tournament? 


9.14 Rolling a Die 
Problem 20] 


A normal die with numbers 1 through 6 on its faces is thrown repeatedly until the 
running total first exceeds 12. 


1. What is the most likely total that will be obtained? 


2. What is the expected total? 


9.15 Fair Duel 
Problem 31] 


Smith and Brown have challenged each other to a duel. They will take turns shooting 
at one another until one has been hit. Smith, who can hit Brown only 40% of the time, 
is the weaker shot, so he will be allowed to go first. They have determined that the 
duel favors neither shooter. What is Brown’s probability of hitting Smith? 
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Solutions 


A Fair Game of Chance 


Let p(n) be Alice’s chances of winning the total amount of a + b, provided she has n 
nickels in her possession. Obviously p(0) = 0. If she is left with a nonzero capital, Alice 


1 
may, at every trial, win or lose one nickel, each with the probability of 3 


eed ee, 


In other words, 2p(n) = p(n + 1) + p(n — 1), or p(n + 1) — p(n) = p(n) — p(n — 1). 
From here, recursively, 


n> 0. 


1 
It follows that p(n) = np(1), and, since p(a + b) = 1, p(1) = ae It follows that 
a 


a 


pla) = pay 


Amoeba’s Survival 


Solution 1 


Good and general notations help solve the problem. Let p be the probability of a 
successful split for a single amoeba and P be the probability in question, the probability 
that an amoeba’s family tree is infinite. 

With the probability p, we have a second generation of two amoebas. The proba- 
bility that at least one of them will have an infinite family tree is 1 — (1 — P)? because 
(1 — P}? is the probability that both of them will perish undivided. Therefore, 


P =p(1—(1— P)’) 


because both sides of the equation represent the probability of the long term survival 
of the original amoeba. 
Simplification yields pP? + (1 — 2p)P = 0, or P- (pP + (1 — 2p)) = 0. Since P Æ 0, 
2p —1 
pP +(1—2p)=O0or P= 2. 
P 


1 
By experimental verification, if p > z there is a true population explosion that 
suggests that, in this case, P # 0. The formula we obtained also confirms that, if a 


1 
generic amoeba divides with the probability not exceeding 5 it stands no chance to 
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3 
survive forever. However, for the specific case where p = mt the probability of survival 


2 
is P= —, 

3 
Notes 


There is a nagging question whether relying on experimental results is a valid argument 
against P = 0. Let us modify the above a little. Let P,, be the probability that the nt? 
generation of amoebas is not empty. Then we have a recurrent relation: 


Prat =p(1—(1—P,)?). 


2p —1 
P 


We shall prove by induction that P, > 


. First off, Po = 1, as we are given 


2p —1 
K . We proceed in several steps: 


that an amoeba is present at the outset. Let P, > 


2p—1 
P, > 2 
p 
2p—-1 1- 
Popa aE a 
p p 
1— 2 
a-r? < (2) 
p 
es 2p—1 
1-0-3} >1- (=) = 2 
p p 
2p— 1 
Pasi =p (1- (1 -— Pa)’) os 


as promised. 


Solution 2 
Christopher D. Long 


This is a disguised biased random walk with an absorbing state at 0 which starts at 
1. If this random walk is at N, considering N moves corresponds to considering the 
1 


=1 


2 
= x by the usual 


fate of N amoebas simultaneously. This yields 1 3 


P 


ml Ww] |] 


arguments. 


Average Visibility of Moviegoers 


Solution 1 
Amit Itagi 


For n patrons, let the number of viewable patrons summed over all permutations be 


g(n) 


g(n). Thus, the average number of patrons over the permutations is ae (denote this 


by f(n)). . 
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There are (n — 1)! permutations where the tallest person is last in line. For this 
case, the sum of viewable patrons over all permutations is g(n — 1) + (n — 1)! (note 
g(n — 1) for the first n — 1 in line and (n — 1)! because the tallest patron is a viewable 
patron for all the permutations). 

If a patron other than the tallest patron is last in line, the sum of viewable patrons 
over all permutations where the last patron is held fixed is g(n — 1) (the last patron is 
not a viewable patron in this case). However, the last patron can be chosen in (n — 1) 
ways for this case. Thus, 


g(r) = g(n — 1) + (n— I)! + (n— 1)g(n — 1) 
= (n—1)!4+ ng(n — 1) 
=> f(in)= - + f(n — 1) (dividing both sides by n!). 


Noting that f(1) = 1, the solution of this recurrence relation is 


“1 
f(n) = 5 -a H,, (n** harmonic number). 
k=1 


Solution 2 


Observe that the i* tallest person is viewable if and only if he is in front of all the 


1 
people 1,2,...,i — 1. This happens with probability —. Therefore, the answer is 
i 


n 


Soe winn +9, 


i=1 


where y = 0.5772157..., Euler’s constant. 


Book Index Range 


Let rn(p) be the sought expectation. We shall show that 
Tn(p) =p + (n— 1)p(1 — p) (9.1) 


by induction on n. 

When n = 1, we have rı (p) = p, which is clearly true. 

Suppose n > 1 and assume equation [9.1] holds for rn—1(p), which is the expected 
number of ranges for an (n — 1)-page book. The addition of one more page may or may 
not change the total number of ranges. When page n is added, the number of ranges 
increases by one if the term occurs on page n and does not occur on page n — 1; this 
happens with probability p(1 — p). Otherwise, the number of ranges does not change. 
Therefore, 


Tr(p) = pl — p): [rn-1(p) + 1) + [1 — p(1 — p)] - rn-1(p) 
= Tm—1(p) +p(1 — p) 


=p + (n-— 1)p(1 — p). 
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Probability of No Two-Tail Runs 


Solution 1 
Let Xn be the number of H/T strings with no two successive T. Xn = Thn + Hn to 
distinguish the strings that end in T from those that end with H. 

Obviously, Hı = Ty = 1, H, = Hn-1 + Tn-1 and T, = Hy,_,. Thus, we may 
compute 


34 
21 


55 | 89 
34 | 55 


Ay, | 1) 1 


It follows that there are X19 = Tio + Hio = 89 + 55 = 144 10-letter-long strings 
with no TT. There are 2!° = 1024 10-letter strings in all. Thus, the probability of not 


144 9 
seeing TT is = —. 
1024 64 
Solution 2 
N.N. Taleb 


Let M be the Markov chain, 


| 
OMNI RNR 
© O MIE 
= NIe o 


representing transitions between the following states: 


NIT|TT 
io 
sal eal ee 
212 
1 1 
Ha eee ae 
2 2 
TT | 0 1 


The matrix at the 10° step is M!° and the probabilities are 


89 
P(N)10 1024 
(1,0,0).M'"=] p(T) | = I 
P(TT)10 55 
64 
Thus, the probability of never getting two tails in 10 throws is 
55 9 


(areata 2. 
PITT )i0 64. 64 
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Artificially Unintelligent 


Solution 1 
Amit Itagi 


The number of bit flips has to be even. Let q = 1 — p: 


n = 
P= No (1) qp” k. 
{k|k=0 (mod 2), k<n} 


Using the binomial expansion, 


+p) +(q-p)” 1+(1-2p)" 
(+p) = p) +0 DE ten 
(Ci ed we AIU di 
2 2 
1+ (2p — 1)” 
Since, for odd n, (—1)” = —1, a 5 so that we can combine the two 


expressions into one: 


1+ (2p—1)" 
SS 


Solution 2 
Hélvio Vairinhos 


1 
Let a represent the correct bit and (') the flipped bit. The transition probabilities 


between two states are given by the (diagonalizable) transfer matrix: 


tee We iia a) ee (a a 
l-—p p 0 2-1 


where 


and 
1 
X = — oe: ; 
J/2\ -1 1 
For n transmissions, the matrix is 
=x! : 
0 (2p—1)" 


1 ( 1+(2p-1) 1-(2p—1)" 
yaz ; 
2\ 1-(2p-1)" 1+(2p-1) 
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The probability of measuring the correct bit after n transmissions is 
1 
(T”)u = (1+ (2-1)"). 


Solution 3 
Timon Knigge 


Using the binomial theorem (a + b)” = Se C) a”—*o*, we want only the terms with 


an even number of failures: 


P 5S (Jeta pt. a sh 


II 


Il 
NI = 
ATT 
> 
i Me 
© 
AET ON 
x z3 
St 

3 

i 

> 
po 

E 

s 
x 

> 
> ees 

+ 
Nl eR 
AET TTS 
> 
iM 
© 
AAT 
xr 3 
SS 

s 

i 

> 
os 

3S 

| 

= 
x 

> 
Se at 


Solution 4 
N.N. Taleb 


We need either 0 or an even number of switches for transmission to be correct. Let ¢ = 
n 

»( ) (1— p)n — x be the probability distribution function of a binomial distribution 
x 


N(n,p) with n trials and the probability of success p, for x successes. We need x to be 
even: © = 0,2,...,n. 
The probability under consideration is 


n/2 
P(n) = Sov ( 2 a — yp = 5+ 2-0") 
x=0 
ee 
af 


for large values of n. Since 0 < p < 1, 


1 1 
lim = (1+ (2p—1)") ==. 
pe aa cee a 


Solution 5 
Alejandro Rodriguez 


Let P(n,1) be the probability that the message gets transmitted correctly given that 


it is correct and it has to be transmitted n more times. Let P(n,0) be the probability 
that the message gets transmitted correctly given that it is incorrect and it has to be 
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transmitted n more times. Recursively, we can compute these probabilities as 


P(n,1) = pP(n — 1,1) + (1 — p)P(n — 1,0) 
P(n,0) = pP(n — 1,0) + (1 — p)P(n — 1,1) 


with boundary conditions 
P(0,1)=1 
P(0,0) = 0. 


From the previous equations, we can show that 


P(n,1) + P(n,0) = P(n — 1,1) + P(n — 1,0), 


which implies that 
P(n,0) = 1— P(n, 1). 


Substitute into the first equation and solve to get 


P(n,1) = (2Qp—-1)P(n—-1,1)+1-p 


P(n,1) = (2p— 1)" + 1-9) Y (2p - 
i=0 

Pn) = (p= 1)" + (= p e 

P(n,1) = C y 


Notes 


1 
As n —> œ, the probability tends to 5 regardless of p. 


Two Sixes in a Row 


Solution 1 
Let P(n) denote the probability that, for the first time, two successive sixes showed 


up on the tosses n — 1 and n. Obviously, P(1) = 0 and P(2) = p?, with p = 7 

Let E(n) be the expected number of tosses. There is no doubt that E(n) > 2. 
We denote the toss that came up with 6 on top as S; all other tosses are denoted 
collectively as F. Any sequence of S and F starts with one of the four combinations: 
SS, SF, FS, FF. With the probability of p? of the first two successes, we achieve 
the goal in just two throws. In case of either SF or FF, we have to add two to the 
number of tosses and start counting again. SF has the probability of p(1 — p); FF 
has the probability (1 — p)?. Concerning the first two tosses being F.S, which has the 
probability of (1 — p)p, the third toss may have two outcomes. If it is S, we achieve 
our goal in three tosses—this with the probability of (1 — p)p?. If it is F, we count 
three steps and continue as from the beginning. This happens with the probability of 
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(1 — p)?p. Thus, we are led to the recurrence relation 


i(n) = 2+ p? 
(E(n) + 2)(p(1 — p) + (1 — p)’) 
+3-(1—p)p? 
(E(n) + 3)(1 — p)’p 


A few algebra steps yield 


_2+p-p  (2-p(p+1)_ 1 1 


— p(2—p)  P(2-p) pP p 


a(n) 


1 
For p= z i(n) = 36 + 6 = 42. 


1 
In passing, for a fair coin, p = 55 that E(n) =4+2=6. 


Solution 2 
Christopher D. Long 


The expected number of throws E(t) until a 6 is on top is six, so 


i = E(tE(t) + t) = E(t)? + E(t) = 42. 


Solution 3 
Alejandro Rodriguez 


Using the Markov chain, we need three nodes: S stands for any sequence of throws 
that does not end with a 6; B is an intermediate node that denotes sequences that end 
with a single 6; V is the final node for the sequences that end in two 6s. 


5/6 


R 
Dur gui 
We derive two equations (borrowing the notations from the associated nodes). The 
equation 
1 5 
V=1+-B+-V 
6 6 


tells us that moving from a node via an attached arrow takes one throw of a die (+1) 


1 
in the diagram. Then with the probability of g i.e., of getting one 6, we move to a 


5 
new state B, while with the probability of 6 we return to where we started (but still 
wasting one toss). 


The other equation is 


5 
B=1+-V 
ta 


which tells us that with one toss we either get to the goal V or back to the start S. 
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We can find V from the two equations: V = 42, which conventionally counts the 


last throw. 


Solution 4 
Issam Eddine 


Let us call T(n), for n € N, a time at which n successive 6s show up for the first time. 
For n € N, T(n) is a random variable. We want to find E(T (2) 
on page|194) that E(T(1)) = 6 and that the game is a Markov process. For n > 1, we 
have the following relation: 


1 


(T(n+1))==(E 


6 


We then deduce that E 


Solution 5 
Michael Wiener 


\(T(n)) +1+E(T( 


S(T (n)) + 1). Thus, 


). We know (Riddle 


n+1))). 


I(T (2)) = 42. 


Using Nielsen’s result [73], we have an L-letter alphabet, with L = 6, and a target string 
S of n letters. Let S’ be the longest common prefix and suffix of S (excluding the case 
of S’ = S). Then E(S) = L” + E(S’). Thus, 


Solution 6 
N.N. Taleb 


II 
oO Olona! ou 
Go Ole 
H Ol oO 


where 


A — 5 s a 


B = (7v5 - 15) (5 
C= (15 + 7v5) (5 


(first 66) = 6? + 


(first 6) = 36+6 = 42. 


gran-tg-m-1 ( (5 + 3v5)"*? — (8 - 3v5)"*') 


v5 


3-2n-ta-n ( (15 + 9v5)” — (15 — 9V5)") 


v5 


RA ae B _ C) 


- 3v5), 
+3v5) 
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We take the third row and get y(t), the probability distribution function of the 
stopping time with the difference of the third rows of M’, and —M/_;: 


v(t) agaist ((3v5 a 5) £ (5 = 3v3)') 


= PEN ( z (7v5 + 15) n (3v5 y 5) 
— 6 (47v5 + 105) (3v5 + 5)" + (7v5 - 15) n(5- 3v5) 


46 (47v5 = 105) (5 = 3v5) +35 Da 


lim E(n) = 42. 


n— o0 


Notes 
N.N. Taleb 


The number of steps to k sixes in a row is 


(n, k) = 


1 
with p = 5 


Matching Socks in Dark Room 


Solution 1 


Let f(n) be the number of pairings of n pairs of socks into acceptable pairs. For n < 2, 
any pairing is acceptable. Furthermore, f(1) = 1; f(2) = 3. 

In general, let us start with a sock of the darkest shade. If we happen to pick a 
sock of the same shade, we are left with computing f(n — 1). Otherwise, there is a pair 
of socks of the next shade; we can choose either of the two and match the other one 
with the remaining sock of the darkest shade. This gives a recurrence: 


f(n) = f(n— 1) + 2f (n — 2). 
The characteristic equation of the latter is zf — x — 2 = 0, with roots x = 2 and 
x = —1. The general solution of the recurrence depends on two constants: f(n) = 
c12” + c2(—1)", which are found from the initial conditions f(1) = 1 and f(2) = 3. 
Thus, we arrive at 


2 


gn+l ai =j: n 
T E 
204 1 
For 10 pairs of socks, there are f(10) = we = 683 possible matchings. 


To find the sought probability, we have to find the total number of pairings, absent 
any restrictions. 
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One sock can be matched with any of 19, but when that pair is picked, the next 
sock may match with any of the remaining 17, and so on. Thus, the total number of 
pairings is 


19!!=19-17-15----- 5-3-1. 


The probability we seek equals =~ ~ 10°. 


Solution 2 


f(n) could have been found via generating functions. Let us set 
F(x) = 0 f(n)a” 
n=l 


=14+32+ 5° f(n-1) +255 f(n- 2)” 
n=3 n—3 


= 1 + 3x +x (F(x) — x) + 2r’ F(x), 


from which 


x + 2r? x 4 1 
F = = 
Oa ae e =) 


= (: > 22)" - Sar) 


n=0 n=0 
= T = n+2 n n 
=— >, 2 —(-1)")z 
n=0 
a 3 gn+2 = (-1)"" ntl 
n=0 3 


n=1 


By definition, f(n) = [a”|F (x), the coefficient of the series by the term x”. Thus, 
as before, 


7 gn+l + (—1)” 
f(n) =. 


Number of Wire Loops 
Amit Itagi 


2 
Let the expected number of loops for n wires be f(n). There are ( 4 = 2n? —n ways 


of choosing two ends from n wires. Out of these, n cases will result in a wire forming 
a self loop with n — 1 wire segments remaining. The other 2n? — 2n cases will result in 
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no self loop with just n — 1 wire segments remaining. Thus, 


[2n? — n] f(n) =n[1 + f(n —1)] + [2n? — 2n] f(n — 1) 


= fin) = fn- 


f(1) =1. Thus, 
n 2n n 
1 1 1 1 H, 
WO a a oa ee 
k=1 k=1 k=1 
Thus, 


H 
f (2018) = H4036 — oe ~ 4.787, 


where H,, is the nt® harmonic number: 
1 


1 1 
H,=14+i+i4e-42. 
Poa ht 


Determinants in Zz 


Solution 1 

Observe that the total number of n x n matrices with elements in Zə equals gn 

The number of nonzero n x n determinants over Zə equals the number of n linearly 

independent (and ordered) rows of n elements from Z2, i.e., vectors in Z}. Note also 

that ifa1,...,a%, k < n, are linearly independent in Z3, then there are exactly Z5 linear 
k 

combinations 5 Aia; € Z3. Thus, there are exactly 2” —2* ways to expand the system 
i=1 

of k linearly independent vectors a1,...,a, to a system of k + 1 linearly independent 

vectors @1,...,@k,@k41- It follows that the number of nonzero determinants over Zə 

of order n is 


(2” — 1)(2" — 2)(2" — 27)... (2% 97-1) = 2" T] (1 = x) 
k=1 


= 1 
From here, pn = II (1 — z) Thus, pn > pn+1. In addition, 
k=1 


A 1 1 Keij 1 2k+1_—1 
II ( x) 23() x)= FC 2k+2 ). 
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implying 


1 1 1 

= = 1 — — = =, 

2 ( 5) 4 
F : 1 
Thus, p = lim pp exists and p> -. 
noo 4 


Solution 2 
Long Huynh Huu 


The problem reduces to computing the number of ordered bases of V = Fy’. This can 
be done by choosing 


e a vector bı € V — span(0) 

e a vector b2 € V — span(b1) 

e a vector b3 € V — span(by, b2) 
e and so on. 


Considering that the number of n x n matrices over Fp is 2” = (2”)”, we obtain 
n—1 


ne “itis 2) -JJa -2 ailen) 


as the probability of the random determinant to be nonzero. 
The probability of a random determinant being 0 is, therefore, 


Gambling in a Company 


The expected number of rounds is 5 aiaj. The probability that the it? gambler ends 
i<j 
up with all the money is a . For n = 3, the expected number of rounds 
a, +a2 +++ + Gn 
3410203 


a, +a2 +43 


until the first loser quits the game is 


n = 2 Analog 


The riddle is an obvious extension of the game of two players (Riddle[9.1]on page|237). 
The latter is usually modeled by the well-known one-dimensional walk where a point 
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1 
on an axis moves one step at a time—left or right—with probabilities p = q = 5° The 
two players start with the capitals of, say, u and v dollars and lose or win one dollar 


1 
from the other player with the probability of 7 On average, the game lasts uv rounds 


v 
in which the probabilities of winning are and iro respectively. 
u+v 


u+ 


Definitions 


Let u = (u1, U2, . - . , Un) denote the state of the game where the it player owns u; > 0 
dollars. We will write |u| = uj +u2 +- --+un. The quantity a = |u| remains unchanged 
during the game. 

Let N(u) be the set of all states that can be attained from u in one round. If u 
has k nonzero components, then |N(u)| = k(k — 1). Define V (u) as the set of all states 
that could be attained starting with u: 


V(u) = {v : Viv; > 0, |u] = |v], (u; = 0) > (v; = 0)}. 


Solution 1 to Question 1 


Let R(u) be the expected number of rounds, starting with u, until one of the players 


has all the money. (We are to prove that a = 5 aiaj.) There is a recurrent relation: 
I< 

R(u) is the average of all x € N(u) plus 1. We also have boundary conditions: R(u) = 0 

only if all components of u but one are zero. 


The function R(u) = 5 a;aj satisfies the recurrence and the boundary condition. 
i<j 
Indeed, for any two components u and v, the product uv appears in every term of the 
sums over the neighborhood N(u). The same holds for —1; the linear terms u and v 
cancel out. We only need to prove that no other function satisfies the recurrence, along 
with the boundary conditions. 

If there are two such solutions, then the difference at any point is the average of 
its values in the point’s neighborhood and, therefore, could not be either maximum or 
minimum (in the neighborhood) unless the function is constant. Then the function is 
constant over the whole play field V = V(x), and, being 0 on the boundary is 0 on V, 
the two solutions coincide. 


Solution 2 to Question 1 
Hélvio Vairinhos 


Let a= 5 a; be the total capital at play. Let R; be the number of rounds played by 
i=1 

the i** gambler. The remaining (n — 1) gamblers are undistinguishable from a single 

gambler with (a — aj) dollars. Hence, we only need to consider two gamblers, for which 


1 n 
E(R;) = aila — a;i). The total number of rounds is R = 5 Xo Ri; hence, 


i=l 
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Solution 1 to Question 2 


Let p;(u) be the probability that the it? player ends up with all the money, starting 
with the state u. (We have to prove that p;(u) = al” pi(u) is the average of all 
u 
p(x), for x € N(u), with the following boundary conditions: p;(u) = 0 if u; = 0 and 
pi(u) = 1 if u; =a so that all other components vanish automatically. 
As before, we simply must prove uniqueness. 


Solution 2 to Question 2 
Hélvio Vairinhos 


Consider a players with 1 dollar each. Before the first round, the probability of winning 
1 

the game is uniform among the players: —. Partition the players into n teams so that the 
a 


it? team has a; members/dollars. Then, the probability of the it? team (i.e., gambler) 
$ 7 A 1 Qi 
winning the game is p; = a; © — = —. 
a a 
Solution to Question 3 


Here the recurrence is exactly the same as for the first question, but the boundary 
condition is different: the game stops when one of the three components vanishes. 


Probability in the World Series 


Solution 1 


The probability of winning the World Series in exactly n games is 


1 1 n— 1 1 n—-1 
Pin) = 5+ za 3 =a ( 3 ), 


n—-1 
1 . i . . th 3 . “Ve: 
Because — is the probability of winning the n"? game, ~——~ is the probability of 


2n—1 
winning three games out of the previous n— 1, and there are two teams to choose from. 
1 1 
We have P(4) = =, P(5) = -, P(6) = E9 P(7) = as The expectation E is then 
8 4 16 16 
1 1 5 5 93 
Bi Be Og OP ae ae 16 ~ 58125- 


Solution 2 
Marcos Carreira 


Form a Markov chain with absorbing states {11, 15, 16, 19, 20, 22, 23, 24} as shown here: 
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Those states are attained with probabilities 


1 1115 5 5 5 
16’ 16’ 8’ 8’ 32’ 32’ 32’ 32 


in {4,4,5,5,6,6, 7,7} games, respectively. Thus, we obtain the expectation of 


1 1 5 5 30 30 35 35 
H H H H = 5.8125. 
4 4 81 8 32 32 1 32 1 32 are 


Solution 3 
Amit Itagi 


For the series to end in n games (n > 4), the winning team has to win three of the 
first n — 1 games and the n‘* game. Moreover, either one of the two teams could be 
the winning team. Thus, the expected number of games is given by 


7 n 
n—-1 1 93 
E = SDa L ae 
yee ("S')G) =i 
= 5.8125. 


Probability of a Meet in an Elimination Tournament 


Solution 1 
Let us consider a more general problem of 2} players. We shall denote the two players 
Aand B. 

If k = 1, the two players necessarily play each other: P, = 1. 

If k = 2, there are three possible pairings, implying that A and B meet in the first 


1 
round with the probability of 3" If not, each wins his first round match with probability 
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1 ONO. 
3 such that the probability of their meeting in the second round is 3’ (5) = 


1 1 1 
Thus, Py = =~ + == =. 
ae D 
In general, with 2” players, A and B meet in the first round with the probability 
1 2k —2 
of KLT With the probability of KOT they do not meet in the first round but 


2 
1 1 
continue to the second with the probability of G) = For the total probability 


Pk, we get a recurrence: 


2(2* — 1) Pe = 2 + (27t —1)- Py. 
After multiplying both sides by 2*~1, we get 

oF (2* — 1) Pp = 2F oR (ght — 1) + Pena, 
This recurrence is easily solved: 

2F (2 — 1) Pp = 25 OF AOR — 1) - P1 


= 9k afi 9k— aye Vane aa = 1) Pp-2 


— 9k 9k 9k 2 4 Qk 02" 3 1)Py_3 


2 1 k-1 
=X Ż+2(2-1)P => 2% =29_ 2 = 2(2% — 1). 
i=k i=k i=0 


1 
Thus, 2*(2* 1) P, = 2(2* — 1) so that P, = For k = 5, the probability is T 


2k—1' 


Solution 2 
Xavier Faure 


With 2" particpants 2% — 1 games will be played out. On the other hand, the total 


2k(Q" — i 
amount of possible games equals et) = 2*-1(2* — 1). Thus, for each game to 
2k —1 1 


be played in the tournament, the probability equals Py, = IIF) = T 


Solution 3 
Amit Itagi 
Suppose there are N players. When two players meet in round k, they are effectively 


the chosen two from a pool of 2% players who competed in the sub-bracket leading to 
that particular match. Let us call such a sub-bracket a k-sub-bracket. 
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2N 
There are Mk = JE k-sub-brackets. The probability that both players end up in a 
particular k-sub-bracket is 


Qe 2—1 


Pi(k) = ow ony: 


The probability that two players from a k-sub-bracket meet in round k is 


1 1 1 1 


Pa(k) = 2-96 ey = eT RIT 


Thus, the probability that the two players play each other is 


N 
Py = Ss Mk < Py(k) - Po(k) 
k=1 
ee ee eee ae 1 1 
2k QN 2N —1 Qk-1 Qk — 1 
k=1 
t +, 1 
oN 7 2 9k—-1  9N-1 
k=l 
For N =5, 
1 
Bo =: 
a I6 
Rolling a Die 


Solution to Question 1 
Problem 20], N.J. Fine 


There are six possible outcomes before the running total exceeds 12: 12, 11, 10, 9, 8, 
7. Here is the list of possible outcomes for any of these: 


12: 13 14 15 16 17 18 
11: 12 13 14 15 16 17 
10: 11 12 13 14 15 16 
9: 10 11 12 13 14 15 
8: 9 10 11 12 13 14 
T: 8 9 10 11 12 13 


The most likely total appears to be 13 as it is the only one with a chance to be 
reached in all six cases. 


Solution to Question 2 
Josh Jordan 


The exact answer for question 2 can be found by setting up a Markov chain and 
solving the resulting set of linear equations. Let x, a,b, c,...,l be the expected sum if 
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the running total is 0,1,2,3,..., 12: 


6r =a+b+c+dt+e+f 


6a=b+c+dt+et+frt+g 


6b=c+d+et+ft+gt+h 
6c=d+e+ftgth+ti 


6d=e+f+g+h+i+j 


6e=f+g+h+i+j+k 


6f=g+h+i+j+k+l 


6g=h+i+j+k+l+13 
6h=i+j+k+l+27 


6i=j+k+l+42 


6j =k+14+58 
6k =1+75 
6l = 93. 
Then 
63954819943 
~ 4353564672 re 


Solution to Question 2, The Long Route (Unrefined but Rigorous Solution) 
N.N. Taleb 


The most likely total is trivial (we leave the solution for later, in the implementation 
section). The expected total is more involved. 

Let 0 = {1,2,...,6}. Let X = {y1,x2,--.} be the set of D-tuples TP, ordered 
multisets generated from 0. If we pick the exhaustive solution of D = 13, the multiset 
total number of members is d = 613. We have Xm = (Xm(1),Xm(2),---,Xm(18)), say 
xı = (1,1,...,1), X2 = (1,1,...,2), etc. Each tuple x; has a probability of occurrence 


1 
of J For each tuple, the corresponding stopping time condition is 1,<,, with 7 the 


first time the preceding sum exceeds 13: 


T(m) = fir : > Xml) > | : 
j oa j 


Let X, be the cumulative tally on the die at stopping time 7. Therefore, we have 


1 D 
(2+) = 3 D. max(xn) 
m=1 
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and the average time to stopping is 


1 d 
(r) =5 X tm). 


Implementation and Refinement 


The previous method, while simple to implement, is computationally heavy. We know 
that the gain in stopping time from D = 12 to D = 13 is 67! (a single tuple has a 
stopping time of 13), and the gain from D = 11 to D = 12 is 671? (only six additional 
tuples have a stopping time of 12). We can safely reduce D by opting for a length 


D 
of integers = a instead of D for the tuples and test for convergence. Thus, we can 


compute the bias: 


Expected Total 


D=5 Expected T 
Quantile 
Expected Total 
D=6 Expected T 
Quantile 
Expected Total 
D=7 Expected 7 
Quantile 
Expected Total 
D=8 Expected T 
Quantile 
Fair Duel 


14.6447 Total Tuples 
3.97605 Stopped Tuples 
14 Precision: 
14.68 Total Tuples 
4.13774 Stopped Tuples 
14 Precision: 
14.6886 Total Tuples 
4.1861 Stopped Tuples 
14 Precision: 
14.69 Total Tuples 
4.19577 Stopped Tuples 
14 Precision: 


7776 
7014 
0.902006 


46656 
45738 
0.980324 


279936 
279144 
0.997171 
1679616 | 


1679121 
0.999705 


Generalizing a little, assume Smith hits Brown with probability q and Brown hits 
Smith with probability p. There is a three state Markov chain: S(mith) shoots, B(rown) 


shoots, the duel End(s). 


If P is the probability of Smith winning the duel (and we know that, the duel being 


1 
fair, P = 5) then 


P=qg+(l—g@)(l=p)P, 
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a recurrence which says that Smith wins on the first shot or gets an equal chance after 
he and Brown both miss. 
This leads to P(¢+p— qp) = q, or (¢+p— qp) = 2q, i.e., p= T In the specific 
=q 
4 2 


4 
case where q = 75, P= 3 = 3" 


We may represent the chain with a transition matrix, e.g., with three states S, B, 
End (in that order): 


0 1l-q q 
M=|1-p 0 p 
0 0 1 


The last row simply means that, once we get to state End, this is where we stay 
with the probability of 1: 


(1-q4)(1- p) 0 q 
M? = 0 (l—g)(l-p) p 
0 0 1 


This tells us that, after two attempts, B is back in B with the probability (1 — 
q)(1 — p). The same is true of the state S. 

The state of affairs after two steps is read from the first row and the recurrence is 
obtained by multiplying vector (1 0 0) by M?: 


(qi) 0 q 
(1 0 0) 0 (l—qg)(l-p) p| =(1-q—p)0q), 
0 0 1 


and, as a consequence, P = q + (1 — p)(1 — q)P. 


Solution 2 
Amit Itagi 


The problem can be described with a decision tree: 


Equivalently, 
P(S win) = 0.4- [1 + 0.6 - (1 — p) + 0.6- (1 —- p)? +...] 
P(B win) = (1 — 0.4)- p- [1+ 0.6(1 — p) + 0.6? - (1 — p)? +...] 


= 04 2 
1-04 3 


P(S win) = P(B win) >p 


Chapter 10 


Sampling of American Mathematics 
Competition Problems 


Probability is a liberal art; it is a child of skepticism, not a tool for people 
with calculators on their belts to satisfy their desire to produce fancy cal- 
culations and certainties. 


Nassim Nicholas Taleb, 2007, The Black Swan 


Riddles 


10.1 Five-Digit Numbers Divisible by Eleven 
AHSME 1970, |80| Problem 31] 


If a number is selected at random from the set of all five-digit numbers in which 
the sum of the digits is equal to 43, what is the probability that the number will be 
divisible by 11? 


10.2 The Odds for Two Teams 
AHSME 1971, Problem 23] 


Teams A and B are playing a series of games. If the odds for either team to win any 
game are even and Team A must win two or Team B three games to win the series, 
then what are the odds for the teams to win the series? 


10.3 Cutting a String into Unequal Pieces 
AHSME 1972, |80| Problem 17] 


A piece of string is cut in two at a point selected at random. What is the probability 
that the longer piece is at least x times as large as the shorter piece (x < 1)? 


10.4 Getting an Arithmetic Progression with Three Dice 
AHSME 1977, [6| Problem 17] 


Three fair dice are tossed (all faces have the same probability of coming up). What 
is the probability that the three numbers turned up can be rearranged to form an 
arithmetic progression with common difference one? 


10.5 Probability of No Distinct Positive Roots 
AHSME 1979, [6| Problem 27] 


An ordered pair (b,c) of integers, each of which has absolute value less than or equal 
to five, is chosen at random, with each such ordered pair having an equal likelihood of 
being chosen. 

What is the probability that the equation x? + bz + c = 0 will not have distinct 
positive roots? 
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10.6 Rolling a Defective Die 
2005 AMC 12A, Problem 14] 


On a standard die, one of the dots is removed at random with each dot equally likely 
to be chosen. The die is then rolled. 
What is the probability that the top face has an odd number of dots? 


10.7 Counting Coins 
AHSME 1980, [6| Problem 20] 


A box contains 2 pennies, 4 nickels and 6 dimes. Six coins are drawn without replace- 
ment, with each coin having an equal probability of being chosen. What is the proba- 
bility that the value of the coins drawn is at least 50 cents? 


10.8 The Roads We Take 
AHSME 1982, [6| Problem 25] 


The figure below is a map of part of a city: the small rectangles are blocks and the 
spaces in between are streets. Each morning a student walks from intersection A to 
intersection B, always walking along the streets shown, always going east or south. 


1 
For variety, at each intersection where he has a choice, he chooses with probability 5 


(independent of all other choices) whether to go east or south. 
L l 

| | j: 
| 


Find the probability that, on any given morning, he walks through intersection C. 


E5 
IPO AE 


10.9 Marbles of Four Colors 
AHSME 1996, Problem 26] 


An urn contains marbles of four colors: red, white, blue and green. When four marbles 
are drawn without replacement, the following events are equally likely: 


1. the selection of four red marbles; 

2. the selection of one white and three red marbles; 

3. the selection of one white, one blue and two red marbles; 
4. the selection of one marble of each color. 


What is the smallest number of marbles satisfying the given conditions? 
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10.10 Two Modified Dice 
AHSME 1997, Problem 10] 


Two six-sided dice are fair in the sense that each face is equally likely to turn up. 
However, one of the dice has the 4 replaced by 3 and the other die has the 3 replaced 
by 4. When these dice are rolled, what is the probability that the sum is an odd 
number? 


10.11 Exchanging Balls of Random Colors 
AMC 2006, Problem 17B] 


Bob and Alice each have a bag that contains one ball of each of the colors blue, green, 
orange, red and violet. Alice randomly selects one ball from her bag and puts it into 
Bob’s bag. Bob then randomly selects one ball from his bag and puts it into Alice’s 
bag. What is the probability that after this process the contents of the two bags are 
the same? 
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Solutions 


Five-Digit Numbers Divisible by Eleven 


The maximum sum of five decimal digits is 45. To obtain 43, we should allow either 
one 7 or two 8s. There are five integers of the first kind, 79999, 97999, 99799, 99979, 
99997, and 10 integers of the second kind, 88999, 89899, 89989, 89998, 98899, 98989, 
98998, 99889, 99898, 99988. 

Numbers are divisible by 11 if their alternating sums of digits are divisible by 11. 
There are two numbers of the first kind that satisfy this criterion: 97999 and 99979. 
Indeed, 


9-74+9-94+9=11, 
9-94+9-74+9=11. 


Of the second kind, only one number, 98989, is divisible by 11. Indeed, 


9-8+9-8+9=11. 


Out of the total of 15 numbers, three are divisible by 11. The probability of this 
1 


. 3 
event is — = =. 


15 5 


The Odds for Two Teams 


The series would last at most four games before Team A either wins two games or 
loses (to Team B) three games. Team A wins the series in two games with probability 


There are two ways for it to win in three games: LWW or WLW. Each occurs 
with probability : so that the probability that Team A wins in exactly three games is 
also x Team A wins in exactly four games if one of the following events takes place: 
WLLW, LWLW, LLWW. Each of these happens with probability a for a total of A 


1 3 11 


1 
The total probability to win the series for Team A is, therefore, Z + i + 16 ae 


Team B wins with probability en The odds are 11 to 5. 


Cutting a String into Unequal Pieces 


Think of the string as a segment AB of length 1 + x. Let P be a point inside AB such 
that AP : PB = z. To satisfy the condition of the riddle, the random point should fall 


1 
to the left of P. This happens with probability ee This is the probability that the 
x 


longer piece is at least x times as long as the shorter piece, provided the longer piece 
is to the right of the shorter one. A similar reasoning applies to the case where the 
longer piece is to the left of the shorter one. This happens with the same probability, 


. The answer is, therefore, ; 
x+1 z+1 
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Getting an Arithmetic Progression with Three Dice 


There are four possible arithmetic progressions with common difference one that could 
be formed by the numbers 1 to 6: {1,2,3}, {2,3,4}, {3,4,5}, {4,5,6}. Of 6° possible 
outcomes, each of these occurs in 3! = 6 cases, making the total probability 


Probability of No Distinct Positive Roots 


—b+ Vb? — 4c 
5 . 
Now, it may be easier to determine when the equation does have distinct positive 
roots. This happens when 


The basis for the solution is naturally the quadratic formula: 71,2 = 


1. c>0, 
2. b < 0 and 
3. b? —4ce>0. 


With b on the horizontal axis and c on the vertical one, only the blue points satisfy 
the three conditions: 


There are 10 such points and 121 = 11? ordered pairs that satisfy |b|, c < 5, making 


1 
~ The probability of not having 


the probability of distinct positive roots equal to I2 


111 


0 
istinct iti ts is then 1 — —— = —. 
distinct positive roots is then 21 1 
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Rolling a Defective Die 


Solution 1 
Thamizh Kudimagan 


Sum over all faces: 
P(face up) x [P(being odd/no change) x P(no change) 
+ P(being odd/change) x P(change)] 


20 1 19 2 18 3 
= } | 1- 1 
al 21 i 21 " 2 2 at? x) 
1 17 4 16 5 15 6 
+z (0 att att a +9 ait ag tt =) 
ill 
21° 
Solution 2 


A. Bogomolny, Amit Itagi, Christopher D. Long, Aaron Haspel 
This is a shortcut from solution 1. 

Originally, there are 1+2+3+4+45+4+6 = 21 dots of which 1+3+5=9 are on 
the odd numbered faces and 2 + 4+ 6 are on the even numbered faces. After a dot is 
removed, there are either two even faces and four odd faces or four even faces and two 


12 
odd faces. The probability of the first event is a1 the probability of the second one is 


9 4 2 
J In the first case, the probability of having an odd face on top is z it is 6 for the 


second case. Since all the variants are mutually exclusive, the total probability is 
4 12 2 9 2.-12+9 1l 
6 2 6 2 3-21 2 


Counting Coins 


Solution 1 


[84] p. 143] 


There are 12 coins and (3) = 924 equiprobable ways to draw six of them. There are 
only two cases where the value of the drawn coins exceeds 50 cents: 

1. 6 dimes 

2. 5 dimes plus any other coin but a dime 


3. 4 dimes plus 2 nickels. 


The number of ways the cases may occur are 1, (5) (D = 36 and (‘) E = 90, 


14+36+90 127 
924 ~ 924° 


making the sought probability equal to 
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The Roads We Take 


Solution 1 


[6] p. 174] 


w 


O hi bid k 
N ʻe o 


E aa a aaa G aa S aa G a 


JO 


The probability that the student passes through C is the sum of the probabilities 
that he passes through intersections C;, i = 0, 1,2,3, and, at each, continues eastward. 


The number of ways of getting from A to C; equals C A ‘ because each such path 
has two horizontal streets which may come in any order. The probability of taking 
each of the passes is (5) 7 due to the available number of choices on the way there. 
Thus, the total probability comes to 


ye 1 1 8 8 2 
^ 2 2 8 16 32° 64 32 


Notes 


Just counting the number of ways from A to B and from A to C would not work. 


The numbers are (5) = 20 and o = 35, allegedly giving the sought probability 


as 35 7 The reason that this does not work is that different pathways come with 


different probabilities such that simple comparison of their numbers is misleading. 
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Solution 2 
[6] p. 174] 


It is easy to construct a tree diagram to count probabilities of getting at every possible 
intersection: 


1 > į > i >o | 
+ + 4 + 
ie ee A 
4 4 4 + 
17 8 7 FFG 
4 4 4 + 
ee ee oh 
4 4 4 4 
Te, SP ae oP ag Se 


Summing up the probabilities, we observe that for the nodes on the right edge going 
southward is the only chance. Similarly, bottom nodes also offer only one option. At all 
other nodes the chances are evenly split between the neighbors on the right and below. 


Solution 3 
Christopher D. Long 


The first location for the walker to reach the outside of the 3 x 3 subsquare must be 
one of the six numbered spots: 


1 
By symmetry, the probabilities on each side must sum to z Spots 1, 2 and 3 must 


pass through C. From the other direction, the walker must pass through spot 6 to 
reach C. 
Thus, the total probability is 


ue 1\5 1 2 
2 2 2 2 32° 
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Solution 4 

Josh Jordan 

Solve using Markov chains. Label each intersection with the probability that a path 
through it will pass through C before reaching B. Each intersection’s probability is 
the average of its south and east neighbors. Working backwards from B, we find the 


21 
probability for A is 30° 


Solution 5 

Amit Itagi 

Start with three paths from A, dividing paths into equals at each step and conserving 

the number of paths at each vertex. 21 of the 32 paths go through C. The answer 
21 


Solution 6 

Daren Chapin 

A key observation is that Pascal’s triangle—the number of ways to reach the 
intersection—continues beyond the eastern edge; everything that crosses that edge 
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20+15+6+1 2 
26 3 


1 
also gets to C. Therefore, a 0.65625. 


Solution 7 
N.N. Taleb 


We introduce the following node (intersection) enumeration: 


1 5 9 13 17 
2 6 10 14 18 
3 7 11 15 19 
4 8 12 16 20 


with the connectivity graph, 


so that the role of C is taken by node number 16. Here are the probabilities of passing 
through each of the 20 nodes: 


Table[{z, Table[PDF[|proc{i], z], {i, 0, 7}]//Total}, {z, 1, 20} 

(oun fo} 02} {03} {58} fod} fo} 
baket eieh find} fad) 
fis, a} , fis, ati ats a} fis, a} , {20, yf 
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Marbles of Four Colors 


Let r, w, b, g be the numbers of red, white, blue and green marbles, respectively. Denote 
the total number of marbles as n = r + w +b + g. The sample space of all possible 


n 
four marble selections consists of ( l elementary events. We just count the number 


of elementary events that combine into the four events described in the problem: 


QOG 
OOO 


The four quantities are equal: 


S Ea a 


The equalities give successively 


Ëu, rot b sad mi- 

As a reminder, we are looking for integer solutions to the three equations, implying 
r = 3 (mod4), r = 2 (mod 3), r = 1 (mod 2). The Chinese Remainder Theorem 
tells us how to find r that satisfy all three conditions. Finding the smallest such r could 
also be accomplished by direct inspection: r = 11. From here, w = 2, b = 3 and g= 5 
so that n = 11 +2 +3 +5 = 21, for the smallest possible r, implying that n is the 
smallest possible total. 

All other possible values of r are given by r = 11 + 12k, where k > 0 is an integer. 


g. 


Two Modified Dice 


An odd sum may result in one of the two dice showing an even number, the other an 
odd number and vice versa. There are four even faces and two odd faces on one die 
and two even and four odd on the other. 

Thus, the probabilities come out to be 


44 22 22 5 
66 66 36 9 


Exchanging Balls of Random Colors 


Regardless of which color ball Alice placed into Bob’s bag, there are now six balls of 
which two are of the same color in Bob’s bag. Only if Bob picks one of these and puts 
it into Alice’s bag will the two bags have the same contents. This happens with the 


2 1 
bability = = +. 
probability z 3 


Chapter 11 


Elementary Statistics 


Statistical thinking will one day be as necessary for efficient citizenship as 
the ability to read and write! 


Samuel S. Wilks, 1951, Presidential Address to the American 
Statistical Association 


Riddles 


11.1 A Question about Median 
Problem 65312| 


One number from a given set of 100 numbers was removed, and it was found that 
the median of the remaining 99 was 74. The number was put back and another was 
removed. The median became 68. 

What is the median of the whole set of 100 numbers? 


11.2 Family Statistics 
p. 45] 


Do men have more sisters than women? 


11.3 How to Ask an Embarrassing Question 


Section 7.6], pp. 15-16] 


How could one conduct a poll with an absolute guarantee of anonymity? 


11.4 An Integer Sequence with Given Statistical Parameters 


Problem 1.1.5] 


Find an integer sequence of seven integers that satisfies the following conditions 
simultaneously: 


Mean: 10 
Median: 9 
Mode: 7 
Range: 15. 


In addition, assume that both 9 and 10 are terms of the sequence. 


11.5 Mean, Mode, Range and Median 
Problem 46] 
A list of five positive integers has mean 12 and range 18. The mode and the median 


are both 8. 
How many different values are possible for the second largest element of the list? 
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Solutions 


A Question about Median 


Solution 1 
Place the numbers in increasing order, say, aj < dg < +--+ < a100. The median of the 
a50 + 451 
a ae 
If a number, say a; with j < 50, was removed, then a5; becomes the median; with 
j > 50, a59 becomes the median. It follows that asọ = 68 and a5; = 74. 
68 + 74 


set is 


The median of all 100 numbers is then = 71. 


Solution 2 
N.N. Taleb 


Let S = {a1, a2, . . . , a50, 451,- : - , Q100}, arranged by rank. Let M(T) be the median of 
set T. Note that the removal of any term above the median has the same effect; the 
removal of any term below the median also has the same effect. It follows that in the 
two experiments the items have been removed from different halves of the set S: 


M(S \ {a>50}) = aso = 68, 
M(S \ {a<s0}) = a51 = 74. 


The median M(S) is therefore 71. 


Family Statistics 


An answer may be surmised from a few examples. Take a small family with two siblings: 
a boy and a girl. The boy has a sister, while the girl does not. In a family with a son 
and two daughters, the boy has two sisters, while the girls have only one each. A 
rule seems to emerge: a girl is excluded from sister counting, while boys count all the 
female siblings there are. From this, the conclusion that men should have more sisters 
than women seems to follow naturally. However, this conclusion is wrong. Men have as 
many sisters as do women. The arguments below seem sufficiently convincing to seal 
the result. 


Solution 1 
A. Bogomolny 


In a family with s girls and n—s boys, the boys have a total of s(n— s) sisters, while the 
girls have s(s — 1) sisters. Assume that boys and girls come into the world with equal 


1 
probabilities of — and that the birth events are independent. Then there are 2” ways a 


family with n children might have come about. Of this, C) —the binomial coefficient 
s 


“n choose s”’—is the number of n-children families with s daughters. Therefore, the 
average number of sisters boys from n-children families have is given by 


Bn = > C) . s(n — 8). 


s=0 
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Similarly, the average number of girls’ sisters in such families is 


Gn = >> C) -s(s— 1). 


s=0 


Both sums are easily computed with generating functions |16|. Let B,(x,y) = 
(a+ y)"2-" and G,(x) = (1+ 2)"2-”. Then 


32 
n= geg (b D. 
while 
2 
Gn = ao 
which immediately implies Bn = Gn = MEEN, 


Solution 2 


The following questions are equivalent: 
1. Do men have more sisters than women? 
2. Do men have more sisters than brothers? 
3. Does a random child have as many sisters as brothers? 


4. Does an average family have as many daughters as sons? 


Yes is an obvious answer to the last question. 


Solution 3 
In a letter from many years ago, a correspondent described a reaction of a friend of 
his in these words: 


For any one person (gender unspecified), the gender distribution of their 
siblings is unaffected by their own gender. There are the same number of 
men and women to be that one person. Therefore men and women have 
the same number of sisters. 


How to Ask an Embarrassing Question 


Let us start with an abstract of the groundbreaking paper by Stanley L. Warner [96]: 


A survey technique for improving the reliability of responses to sensitive 
interview questions is described. The technique permits the respondent to 
answer “yes” or “no” to a question without the interviewer knowing what 
information is being conveyed by the respondent. The privacy of the inter- 
viewee is protected by randomizing his response. For example if all members 
of a population belong either to group A or to group B and the investigator 
wants to determine the proportion of group A individuals in the popula- 
tion, this information can be elicited by using the following procedures. 
Before each interview, the respondent is provided with a spinner marked 
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with a point A and a point B. The spinner is marked off in such a way that 
the spinner’s marker will stop at point A with a probability of p and at 
point B with a probability of 1 — p. When the interviewer asks the sensitive 
question concerning group membership, the respondent spins the spinner 
out of the sight of the interviewer. The marker will either stop at point A 
or point B. The respondent then indicates whether or not he belongs to the 
group to which the marker is pointing. The respondent does not tell the 
interviewer where the marker is pointing. Assuming that these responses 
are truthful, it is then possible to determine maximum likelihood estimates 
of the true proportion of A’s in the population. The formula for calculating 
these estimates is provided. 


Let N be the population of respondents, P the number of those that belong to 
group A and M the number of yes answers received. The task is to estimate P. 


Solution 1 


M is comprised of truthful answers from a proportion of P and truthful answers from 
a proportion of (N — P): 
M =pP+(1—p)(N — P) = P(Qp-—1)+ N(1-p) 


M~—N(1-p) 


from which P = 
2p— 1 


1 
. Obviously, p = 3 should be avoided. 


Solution 2 


Section 7.6] 


Instruct each respondent to toss two fair coins secretly. If the coins both land heads 
up, then that person reports the opposite of the truthful answer. Anyone who flips at 
least one tail answers truthfully. 

As there are four equiprobable outcomes for a toss of two coins, one fourth of the 
population will lie, while three fourths will answer truthfully. Assuming that members 
of group A are in the same proportion as the general population, we obtain the following 
equation: 

3 


(N-—P)+=P 


M= 
A 


Ble 


1 
from which P = 2 (m — IN) 


Solution 3 
pp. 15-16] 


Instruct each respondent to toss a fair coin (out of sight of the interviewer). If the coin 
shows tails, ask the person to truthfully answer whether he/she belongs to group A. If 
the coin shows heads, instruct the person to record the result of the second toss, with 
“yes” for, say, heads and “no” for tails. Ask the person to report the final result: 

1 1 


M=-P+-N 
2 t7 


1 
from which we get the same formula as in solution 2, P = 2 (a1 = 1^ J 


Chapter 11: Solutions 277 


Notes 


Formulas in solutions 2 and 3 can both be obtained formally from that in solution 1, 


setting : 
ing p = —. 
4 


An Integer Sequence with Given Statistical Parameters 


The riddle, as stated, has three solutions that were found—one at a time—by just 
playing with pencil and paper, until Amit Itagi undertook a rigorous investigation. 


Solution 1 


By inspection, with a little effort we find 
4,7,7,9, 10, 14, 19. 


Solution 2 
Gökhan Karahan 


One solution is 


5,7,7,9, 10, 12, 20. 


Solution 3 
Bryan Johnson 


One solution is 


3,7,7,9, 12, 14, 18. 


Solution 4 
Amit Itagi 


Let the sequence sorted from smallest to largest values be labeled a;, i € {1,2,...,7}. 
From the median, a4 = 9, and one of as,ag,a@7 is 10. From the mode, either two or 
three of a1, a2, a3 are 7. 

Suppose all three are 7. Then from the range, a7 = 7 + 15 = 22. From the mean, 
the last undetermined number is 70 — 3 x 7 — 9 — 10 — 22 = 8. This is not possible as 
the undetermined number has to be greater than a4 = 9. 

Suppose there are two 7s in the sequence. First, suppose that 7 is the smallest 
number. Thus, a3 = 8 or a3 = 9. From the range, a7 = 7+ 15 = 22. Then, from the 
mean, the remaining two undetermined numbers (a3 and another one) have to add up 
to 70—2 x 7—9—10—22 = 15. Thus, the last number is either 15—8 = 7 or 15-9 = 6. 
Neither is greater than a4 = 9. Thus, this case is not possible. 

The remaining case is that there are two 7s and a, < 7. From the range, a7 = a,+15. 
Thus, from the mean, a1, a7 and the other undetermined number (denoted by b) add up 
to 70—2 x 7—9—10 = 37. Thus, 6+ 2a,+15 = 37 or b+ 2a, = 22. Note b < a7 =a, +15. 
Thus, 22 = b+ 2a, < 3aı +15 or a, > 2. Noting that a, < 7, the possible combinations 
of (a1, b) are (3,16), (4,14), (5,12) and (6, 10). The last combination is disallowed as 7 
is the mode. Thus, the possible sequences are: (3, 7, 7,9, 10, 16, 18), (4,7, 7,9, 10, 14, 19) 
and (5, 7,7, 9, 10, 12, 20). 
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Solution 5 
N.N. Taleb 


Let X = {x1), X(2),%(3), T(4), X(5), X(6); x7) } ordered such that 
T) S 22) S+ S Ee). 
We have x7) = z(a) + 15, £4) = 9, and we need to solve for: 
1 
7 CTW +2) +26) +26) + Te) + 24) = 10, 
with 
(AVBVC)ADAEAF, 


A =: zı = £2 =T NTs < ze < zı +15, 


=: £2 = L3 = T NTs < ze < xı +15, 


C =: zı = T2 = £3 = T ^ £5 < £6 < z1 +15, 
D =: 9< x5 < 10, 


E=: £1 < £2 < £3 < 9 < 45 < 10 < z6, 


F =: ((a5 = 9^ ze = 10) V z5 = 10). 


The acceptable sequences are therefore 


3 7 7 9 10 16 18 
4 7 7 9 10 14 19 
5 7 7 9 10 12 20 


Mean, Mode, Range and Median 


Let the numbers be a, b,c, d,e, in increasing order. First, as the mean is 12, a+b+c+ 
d+e = 60. Second, since the median is 8, c = 8. Third, since the range is 18, e—a = 18. 
Thus, we are looking at the sequence a, b,8,d,a+ 18, satisfying 2a + b+ d+ 26 = 60, 
or 2a + b + d = 34. In addition, 8 is a unique mode so that either b = 8 or d = 8 (or 
both). We come to two possibilities: 


1. 2a+ d= 26, with d> 8, or 


2. 2a+b = 26, with b < 8. 


In the first case, d > 8, implying 2a < 18, i.e., a < 9. However, for a, a < b= 8 is 
a stronger inequality. At this point, we have the following information concerning the 
five numbers: a < 8, b = c = 8, d > 8, e = a + 18. The least value a can be is 3, for, 
with a = 2, e = 19, implying D > 60 — 2 — 8 — 8 — 20 = 22 > 20. Therefore, we have 
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the following table: 


d 


abe e 

3 8 8 20 21 
4 8 8 18 22 
5 8 8 16 23 
6 8 8 14 24 
7 8 8 12 25 
8 8 8 10 26 


The second largest element may be one of 10, 12, 14, 16, 18, 20. 
In the second case, where 2a + b = 26 with a < b < 8, we have an obvious 


impossibility. 


Appendix A 


Dependent and Independent Events 


The word independent is ubiquitous in the study of probabilities in at least two 
circumstances. 


e Independent experiments: The same or different experiments may be run 
in a sequence, with the sequence of outcomes being the object of interest. For 
example, we may be interested to study patterns of heads and tails in successive 
throws of a coin. We then talk of a singular compound experiment that combines 
a sequence of constituent trials. The trials—the individual experiments—may or 
may not affect the outcomes of later trials. If they do, the experiments are called 
dependent; otherwise, they are independent. The sample space of the compound 
experiment is formed as a product of the sample spaces of the constituent trials. 


e Independent events: An event is a subset of a sample space. Events may or 
may not be independent; according to the definition, two events, A and B, are 
independent only if P(AN B) = P(A)P(B). 


It is a common practice to blur the distinction between these circumstances. When 
running independent experiments, the usage of the product formula P(A N B) = 
P(A)P(B) is justified on combinatorial grounds. For a pair of independent events, 
the formula serves as a definition. An association between the two, as discussed here, 
provides a justification for the latter. 

Consider tossing a coin three times in a row. Since each of the throws is independent 
of the other two, we consider all 8 = 2? possible outcomes as equiprobable and assign 


each the probability of - The following is the sample space of a sequence of three 
coin tosses: 
{HHH, HHT, HTH, HTT,THH,THT,TTH,TTT}. 
There are 28 possible events, but we are presently interested in, say, two: 
A={HHH,HTH,THH,TTH} and 
B={HHH,HHT,THH,THT}. 


A is the sequence of tosses in which the third one comes up heads. B is the event 
in which heads comes up on the second toss. Since each contains four outcomes out of 
the equiprobable eight, 


1 
The result may have been expected: after all, 3 is the probability of heads on a 


single toss. Are events A and B independent according to the definition? Indeed they 
are. To see that, observe that 


ANB ={HHH,THH}, 
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2 1 
the event of having heads on the second and third tosses. P(AN B) = — Further, 
let us find the conditional probability P(A|B): 


P(AN B) 


P(AIB) = Eom 


P(A|B) = P(A), and, according to the definition, events A and B are independent, 
as expected. 

This is in fact always the case. Assume we run a sequence of (independent) experi- 
ments with, among others, two possible outcomes x and y with probabilities P(x) = p 
and P(y) = q. The event that the first outcome in a sequence of experiments happens 
to be x has the probability of p because something happens on every trial with the 
probability of 1 and the combinatorial product rule applies. Similarly, the event of 
having the outcome of y on the second trial (in any sequence of experiments) has the 
probability of q. Using random variables Vı and V2 for the outcomes of the first and 
second experiments, we may express this in the following manner: 


P(V, = x) =p and 
P(V =y) =4. 
If A and B are the corresponding events, P(A) = p, P(B) = q. The event AN B 
of having x on the first experiment and y on the second has the probability of L, It 


follows that 


Pale) = “SOS = Flap 


making the events A and B independent. 

The following remark may be surprising but is worth being noted. Two events A and 
B may belong (both of them) to different sample spaces. Moreover, their dependency 
may be a function of the sample space at hand. 

For example, assume the sample space is depicted as a rectangular part of a square 
grid as shown in Figure[A.1] Event A is described by a 6 x 10 rectangle, event B by a 
5 x 5 rectangle. The two intersect at a 3 x 1 rectangle. 
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Figure A.1: Dependence depends on sample space. 


First, consider the ambient sample space of size 25 x 20. Then 


TER _ P(ANB) 3 
o PA e = oe 


We see that A and B are independent. However, if the sample space measures 
24 x 20, the situation is different: 
60 P(ANB) 3 


agg 7 P(A) # P(AIB) = PB) 35 


Appendix B 


Principle of Symmetry 


According to the Principle of Symmetry, also known as the Principle of Indifference, 
equal or uniform probabilities should be assigned to equipossible events so that equipos- 
sible becomes equiprobable [47]. We will leave it at that with a quote from Bas C. van 
Fraassen’s book, fao]: 


Since its inception in the seventeenth century, probability theory has often 
been guided by the conviction that symmetry can dictate probability. The 
conviction is expressed in such slogan formulations as that equipossibil- 
ity implies equal probability, and honoured by such terms as indifference 
and sufficient reason. As in science generally we can find here symmetry 
arguments proper that are truly a priori, as well as arguments that simply 
assume contingent symmetries, and “arguments” that reflect the thirst for 
a hidden, determining reality. The great failure of symmetry thinking was 
found here, when indifference disintegrated into paradox; and great success 
as well, sometimes real, sometimes apparent. The story is especially impor- 
tant for philosophy, since it shows the impossibility of the ideal of logical 
probability. 


Example 1 
When n points are dropped on a unit segment, the distributions of lengths of the so 
obtained n + 1 pieces are all the same. 

To see why this indeed may be so, drop n + 1 points on a circle. Then, because of 
the symmetry, all the arcs so obtained have the same distribution. Now, tear the circle 
at one of the points. It must be true for the resulting segment with n marked points 
that all its pieces have the same length distribution. 


See Riddles and 
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9.10 | Determinants in Z, 238 251 
9.11 | Gambling in a Company 238 252 
9.12 | Probability in the World Series 239 254 
9.13 | Probability of a Meet in an 239 255 
Elimination Tournament 
9.14 | Rolling a Die 239 257 
9.15 | Fair Duel 239 259 
10.1 | Five-Digit Numbers Divisible by Eleven | 261 264 
10.2. | The Odds for Two Teams 261 264 
10.3 | Cutting a String into Unequal Pieces 261 264 
10.4 | Getting an Arithmetic Progression 261 265 
with Three Dice 
10.5 | Probability of No Distinct 261 265 
Positive Roots 
10.6 | Rolling a Defective Die 262 266 
10.7 | Counting Coins 262 266 
10.8 |The Roads We Take 262 267 
10.9 | Marbles of Four Colors 262 271 
10.10 | Two Modified Dice 263 271 
10.11 | Exchanging Balls of Random Colors 263 271 
11.1 |A Question about Median 273 274 
11.2 | Family Statistics 203 274 
11.3 | How to Ask an Embarrassing Question | 273 275 
11.4 | An Integer Sequence with Given 273 27h 
Statistical Parameters 
11.5 | Mean, Mode, Range and Median 273 278 


